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Abstract 

Most state-of-the-art satisfiability algorithms today are variants of the DPLL procedure 
augmented with clause learning. The main bottleneck for such algorithms, other than the obvi- 
ous one of time, is the amount of memory used. In the field of proof complexity, the resources 
of time and memory correspond to the length and space of resolution proofs. There has been a 
long line of research trying to understand these proof complexity measures, as well as relating 
them to the width of proofs, i.e., the size of the largest clause in the proof, which has been 
shown to be intimately connected with both length and space. While strong results have been 
proven for length and width, our understanding of space is still quite poor. For instance, it has 
remained open whether the fact that a formula is provable in short length implies that it is also 
provable in small space (which is the case for length versus width), or whether on the con- 
trary these measures are completely unrelated in the sense that short proofs can be arbitrarily 
complex with respect to space. 

In this paper, we present some evidence that the true answer should be that the latter case 
holds and provide a possible roadmap for how such an optimal separation result could be ob- 
tained. We do this by proving a tight bound of 0( v / n) on the space needed for so-called 
pebbling contradictions over pyramid graphs of size n. This yields the first polynomial lower 
bound on space that is not a consequence of a corresponding lower bound on width, as well as 
an improvement of the weak separation of space and width in (Nordstrom 2006) from logarith- 
mic to polynomial. 

Also, continuing the line of research initiated by (Ben-Sasson 2002) into trade-offs between 
different proof complexity measures, we present a simplified proof of the recent length-space 
trade-off result in (Hertel and Pitassi 2007), and show how our ideas can be used to prove a 
couple of other exponential trade-offs in resolution. 



1 Introduction 

Ever since the fundamental NP -completeness result of Cook (21], the problem of deciding whether 
a given prepositional logic formula in conjunctive normal form (CNF) is satisfiable or not has been 
on center stage in Theoretical Computer Science. In more recent years, SATISFIABILITY has gone 
from a problem of mainly theoretical interest to a practical approach for solving applied problems. 
Although all known Boolean satisfiability solvers (SAT-solvers) have exponential running time in 
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the worst case, enormous progress in performance has led to satisfiability algorithms becoming a 
standard tool for solving a large number of real-world problems such as hardware and software 
verification, experiment design, circuit diagnosis, and scheduling. 

A somewhat surprising aspect of this development is that the most successful SAT-solvers to 
date are still variants of the resolution-based Davis-Putnam-Logemann-Loveland (DPLL) proce- 
dure [25, 26] augmented with clause learning. For instance, the great majority of the best algo- 
rithms at the 2007 round of the international SAT competitions [53] fit this description. DPLL 
procedures perform a recursive backtrack search in the space of partial truth value assignments. 
The idea behind clause learning, or conflict-driven learning, is that at each failure (backtrack) point 
in the search tree, the system derives a reason for the inconsistency in the form of a new clause and 
then adds this clause to the original CNF formula ("learning" the clause). This can save a lot of 
work later on in the proof search, when some other partial truth value assignment fails for similar 
reasons. The main bottleneck for this approach, other than the obvious one of time, is the amount 
of memory used by the algorithms. Since there is only a finite amount of space, all clauses cannot 
be stored. The difficulty lies in obtaining a highly selective and efficient clause caching scheme 
that nevertheless keeps the clauses needed. Thus, understanding time and memory requirements 
for clause learning algorithms, and how these requirements are related to one another, is a question 
of great practical importance. We refer to, e.g., |9j[36l|5T| for a more detailed discussion of clause 
learning (and SAT-solving in general) with examples of applications. 

The study of proof complexity originated with the seminal paper of Cook and Reckhow ll23l . In 
its most general form, a proof system for a language L is a predicate P(x, n), computable in time 
polynomial in \x\ and \ ir\, such that for all x G L there is a string it (a proof) for which P(x,ir) = 1, 
whereas for any x L it holds for all strings 7r that P{x, tt) = 0. A proof system is said to be 
polynomially bounded if for every x G L there is a proof tt x of size at most polynomial in \x\. A 
propositional proof system is a proof system for the language of tautologies in propositional logic. 

From a theoretical point of view, one important motivation for proof complexity is the intimate 
connection with the fundamental question of P versus NP. Since NP is exactly the set of languages 
with polynomially bounded proof systems, and since TAUTOLOGY can be seen to be the dual prob- 
lem of SATISFIABILITY, we have the famous theorem of [23] that NP = CO-NP if and only if there 
exists a polynomially bounded propositional proof system. Thus, if it could be shown that there are 
no polynomially bounded proof systems for propositional tautologies, P ^ N P would follow as a 
corollary since P is closed under complement. One way of approaching this distant goal is to study 
stronger and stronger proof systems and try to prove superpolynomial lower bounds on proof size. 
However, although great progress has been made in the last couple of decades for a variety of proof 
systems, it seems that we are still very far from fully understanding the reasoning power of even 
quite simple ones. 

A second important motivation is that, as was mentioned above, designing efficient algorithms 
for proving tautologies (or, equivalently, testing satisfiability), is a very important problem not only 
in the theory of computation but also in applied research and industry. All automated theorem 
provers, regardless of whether they actually produce a written proof, explicitly or implicitly define 
a system in which proofs are searched for and rules which determine what proofs in this system 
look like. Proof complexity analyzes what it takes to simply write down and verify the proofs that 
such an automated theorem-prover might find, ignoring the computational effort needed to actually 
find them. Thus a lower bound for a proof system tells us that any algorithm, even an optimal 
(non-deterministic) one making all the right choices, must necessarily use at least the amount of a 
certain resource specified by this bound. In the other direction, theoretical upper bounds on some 
proof complexity measure give us hope of finding good proof search algorithms with respect to 
this measure, provided that we can design algorithms that search for proofs in the system in an 
efficient manner. For DPLL procedures with clause learning, the time and memory resources used 
are measured by the length and space of proofs in the resolution proof system. 
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The field of proof complexity also has rich connections to cryptography, artificial intelligence 
and mathematical logic. Some good surveys providing more details are f71 [TUl l54l . 

1.1 Previous Work 

Any formula in prepositional logic can be converted to a CNF formula that is only linearly larger 
and is unsatisfiable if and only if the original formula is a tautology Therefore, any sound and 
complete system for refuting CNF formulas can be considered as a general prepositional proof 
system. 

Perhaps the single most studied proof system in prepositional proof complexity, resolution, is 
such a system that produces proofs of the unsatisfiability of CNF formulas. The resolution proof 
system appeared in [ 16 ] and began to be investigated in connection with automated theorem proving 
in the 1960s ll25l l26l l50l . Because of its simplicity — there is only one derivation rule — and because 
all lines in a proof are clauses, this proof system readily lends itself to proof search algorithms. 

Being so simple and fundamental, resolution was also a natural target to attack when developing 
methods for proving lower bounds in proof complexity. In this context, it is most straightforward 
to prove bounds on the length of refutations, i.e., the number of clauses, rather than on the total size 
of refutations. The length and size measures are easily seen to be polynomially related. In 1968, 
Tseitin ll58l presented a superpolynomial lower bound on refutation length for a restricted form 
of resolution, called regular resolution, but it was not until almost 20 years later that Haken |[32l 
proved the first superpolynomial lower bound for general resolution. This weakly exponential 
bound of Haken has later been followed by many other strong results, among others truly expo- 
nential lower bound on resolution refutation length for different formula families in, for instance, 

A second complexity measure for resolution, first made explicit by Galil [30], is the width, 
measured as the maximal size of a clause in the refutation. Ben-Sasson and Wigderson lfl5ll showed 
that the minimal width W(F h 0) of any resolution refutation of a fe-CNF formula F is bounded 
from above by the minimal refutation length L(F h 0) by 

W(FhO) = O(^/nlogL(Fh0)) , (1.1) 

where n is the number of variables in F. Since it is also easy to see that resolution refuta- 
tions of polynomial-size formulas in small width must necessarily be short (for the reason that 
(2 • ^variables)™ is an upper bound on the total number of distinct clauses of width w), the result 
in 031 can be interpreted as saying roughly that there exists a short refutation of the A;-CNF formu- 
la F if and only if there exists a (reasonably) narrow refutation of F. This gives rise to a natural 
proof search heuristic: to find a short refutation, search for refutations in small width. It was shown 
in H4] that there are formula families for which this heuristic exponentially outperforms any DPLL 
procedure regardless of branching function. 

The formal study of space in resolution was initiated by Esteban and Toran Il28ll56l . Intuitively, 
the space Sp(n) of a resolution refutation tt is the maximal number of clauses one needs to keep 
in memory while verifying the refutation, and the space Sp (F h 0) of refuting F is defined as the 
minimal space of any refutation of F. A number of upper and lower bounds for refutation space 
in resolution and other proof systems were subsequently presented in, for example, l2l [T3ll27ll29l . 
Just as for width, the minimum space of refuting a formula can be upper-bounded by the size of the 
formula. Somewhat unexpectedly, however, it also turned out that the lower bounds on resolution 
refutation space for several different formula families exactly matched previously known lower 
bounds on refutation width. Atserias and Dalmau [ 5 ] showed that this was not a coincidence, but 
that the inequality 

W(Fh0) < Sp(Fh0) + O(l) (1.2) 
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holds for any /c-CNF formula F, where the (small) constant term depends on k. In [42], the first 
author proved that the inequality (11.21) is asymptotically strict by exhibiting a /c-CNF formula family 
of size O(n) refutable in width W(F n h 0) = 0(1) but requiring space Sp(F n h 0) = O(logn). 

The space measure discussed above is known as clause space. A less well-studied space mea- 
sure, introduced by Alekhnovich et al. 0, is variable space, which counts the maximal number of 
variable occurrences that must be kept in memory simultaneously. Ben-Sasson [1 1] used this mea- 
sure to obtain a trade-off result for clause space versus width in resolution, proving that there are 
/c-CNF formulas F n that can be refuted in constant clause space and constant width, but for which 
any refutation 7r n must have Sp(ir n ) ■ W(i: n ) = 0(n/ log n). More recently, Hertel and Pitassi ll33l 
showed that there are CNF formulas F n for which any refutation of F n in minimal variable space 
VarSp(F n h 0) must have exponential length, but by adding just 3 extra units of storage one can 
instead get a resolution refutation in linear length. 

1 .2 Questions Left Open by Previous Research 

Despite all the research that has gone into understanding the resolution proof system, a number of 
fundamental questions still remain unsolved. We touch briefly on two such questions below, and 
then discuss a third one, which is the main focus of this paper, in somewhat more detail. 

Equation (ll.ll ) says that short refutation length implies narrow refutation width. Combining 
Equation (1 1 -2b with the observation above that narrow refutations are trivially short, we get a similar 
statement that small refutation clause space implies short refutation length. Note, however, that 
this does not mean that there is a refutation that is both short and narrow, or that any small-space 
refutation must also be short. The reason is that the resolution refutations on the left- and right-hand 
sides of (11.11 ) and (11.21 ) need not (and in general will not) be the same one. 

In view of the minimum-width proof search heuristic mentioned above, an important question 
is whether short refutation length of a formula does in fact entail that there is a refutation of it that 
is both short and narrow. Also, it would be interesting to know if small space of a refutation implies 
that it is short. It is not known whether there are such connections or whether on the contrary there 
exist some kind of trade-off phenomena here similar to the one for space and width in ifTTTl . 

A third, even more interesting problem is to clarify the relation between length and clause 
space. For width, rewriting the bound in (11.11 ) in terms of the number of clauses | F n | instead of 
the number of variables we get that that if the width of refuting F n is LO^\F n \ log\F n \), then the 
length of refuting F n must be superpolynomial in \F n \. This is known to be almost tight, since 
lfT8l shows that there is a /c-CNF formula family {F n }™ =1 with W(F n h 0) = n({/\F^\) but 
L(F n h 0) = 0(| F n |). Hence, formula families refutable in polynomial length can have somewhat 
wide minimum-width refutations, but not arbitrarily wide ones. 

What does the corresponding relation between space and length look like? The inequality (11.21 ) 
tells us that any correlation between length and clause space cannot be tighter than the correlation 
between length and width, so in particular we get from the previous paragraph that /c-CNF formulas 
refutable in polynomial length may have at least "somewhat spacious" minimum-space refutations. 
At the other end of the spectrum, given any resolution refutation 7r of F in length L it can be proven 
using results from I28ll34l that Sp{ir) = 0(L/ log L). This gives an upper bound on any possible 
separation of the two measures. But is there a Ben-Sasson-Wigderson kind of upper bound on 
space in terms of length similar to (11.11 )? Or are length and space on the contrary unrelated in the 
sense that there exist /c-CNF formulas F n with short refutations but maximal possible refutation 
space Sp(F n h 0) = tt(L(F n h 0)/log L(F n h 0)) in terms of length? 

We note that for the restricted case of so-called tree-like resolution, ll28l showed that there is 
a tight correspondence between length and space, exactly as for length versus width. The case for 
general resolution has been discussed in, for instance, ifTTl [29l l5VTl . but there seems to have been no 
consensus on what the right answer should be. However, these papers identify a plausible formula 
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family for answering the question, namely so-called pebbling contradictions defined in terms of 
pebble games over directed acyclic graphs. 

1.3 Our Contribution 

The main result in this paper provides some evidence that the true answer to the question about 
the relationship between space and length is more likely to be at the latter extreme, i.e., that the 
two measures can be separated in the strongest sense possible. More specifically, as a step towards 
reaching this goal we prove an asymptotically tight bound on the clause space of refuting pebbling 
contradictions over pyramid graphs. 

Theorem 1.1. The clause space of refuting pebbling contradictions over pyramids of height h 
in resolution grows as Q(h), provided that the number of variables per vertex in the pebbling 
contradictions is at least 2. 

This yields the first separation of space and length (in the sense of a polynomial lower bound 
on space for formulas refutable in polynomial length) that is not a consequence of a corresponding 
lower bound on width, as well as an exponential improvement of the separation of space and width 
in E2. 

Corollary 1.2. For all k > 4, there is a family {F n }^ =1 of k- CNF formulas of size 0(n) that can 
be refuted in resolution in length L{F n h 0) = O(n) and width W{F n h 0) = 0(1) but require 
clause space Sp(F n h 0) = @(^/n). 

In addition to our main result, we also make the the observation that the proof of the recent 
trade-off result in [33] can be greatly simplified, and the parameters slightly improved. Using 
similar ideas, we can also prove exponential trade-offs for length with respect to clause space 
and width. Namely, we show that there are fc-CNF formulas such that if we insist on finding the 
resolution refutation in smallest clause space or smallest width, respectively, then we have to pay 
with an exponential increase in length. We state the theorem only for length versus clause space. 

Theorem 1.3. There is a family of k-CNF formulas {i ? n }^L 1 of size 0(n) such that: 

• The minimal clause space of refuting F n in resolution is Sp(F n h 0) = 0(-y/n). 

• Any resolution refutation ir : F n h in minimal clause space must have length L(ir) = 
exp(0(^)). 

• There are resolution refutations n' : F n h in asymptotically minimal clause space Sp(ir') = 
0(Sp(F n h 0)) and length L(tt') = 0(n), i.e., linear in the formula size. 

A theorem of exactly the same form can be proven for length versus width as well. 

2 Proof Overview and Paper Organization 

Since the proof of our main theorem is fairly involved, we start by giving an intuitive, high-level 
description of the proofs of our results and outlining how this paper is organized. 

2.1 Sketch of Preliminaries 

A resolution refutation of a CNF formula F can be viewed as a sequence of derivation steps on a 
blackboard. In each step we may write a clause from F on the blackboard (an axiom clause), erase 
a clause from the blackboard or derive some new clause implied by the clauses currently written 
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on the blackboard^] The refutation ends when we reach the contradictory empty clause. The length 
of a resolution refutation is the number of distinct clauses in the refutation, the width is the size of 
the largest clause in the refutation, and the clause space is the maximum number of clauses on the 
blackboard simultaneously. We write L(F h 0), W(F h 0) and Sp(F h 0) to denote the minimum 
length, width and clause space, respectively, of any resolution refutation of F. 

The pebble game played on a directed acyclic graph (DAG) G models the calculation described 
by G, where the source vertices contain the input and non-source vertices specify operations on the 
values of the predecessors. Placing a pebble on a vertex v corresponds to storing in memory the 
partial result of the calculation described by the subgraph rooted at v. Removing a pebble from v 
corresponds to deleting the partial result of v from memory. A pebbling of a DAG G is a sequence 
of moves starting with the empty graph G and ending with all vertices in G empty except for a 
pebble on the (unique) sink vertex. The cost of a pebbling is the maximal number of pebbles used 
simultaneously at any point in time during the pebbling. The pebbling price of a DAG G is the 
minimum cost of any pebbling, i.e., the minimum number of memory registers required to perform 
the complete calculation described by G. 

The pebble game on a DAG G can be encoded as an unsatisnable CNF formula Pcb d G , a so- 
called pebbling contradiction of degree d. See Figure [Qfor a small example. Very briefly, pebbling 
contradictions are constructed as follows: 

• Associate d variables x(v)\, . . . , x(v)d with each vertex v (in FigureQ]we have d = 2). 

• Specify that all sources have at least one true variable, for example, the clause x(r)i V x(r)2 
for the vertex r in Figure [TJ 

• Add clauses saying that truth propagates from predecessors to successors. For instance, for 
the vertex u with predecessors r and s, clauses 4-7 in Figure Q] are the CNF encoding of the 
implication (x(r)i V x{r)2) A (x(s)i V x(s)2) — * (x(u)i V x(u)2). 

• To get a contradiction, conclude the formula with x(z) 1 A • • • A x(z) d where z is the sink of 
the DAG. 

We will need the observation from lfl4l that a pebbling contradiction of degree d over a graph with 
n vertices can be refuted by resolution in length 0(d 2 • n) and width 0(d). 

2.2 Proof Idea for Pebbling Contradictions Space Bound 

Pebble games have been used extensively as a tool to prove time and space lower bounds and 
trade-offs for computation. Loosely put, a lower bound for the pebbling price of a graph says that 
although the computation that the graph describes can be performed quickly, it requires large space. 
Our hope is that when we encode pebble games in terms of CNF formulas, these formulas inherit 
the same properties as the underlying graphs. That is, if we pick a DAG G with high pebbling price, 
since the corresponding pebbling contradiction encodes a calculation which requires large memory 
we would like to try to argue that any resolution refutation of this formula should require large 
space. Then a separation result would follow since we already know from lfl4l that the formula can 
be refuted in short length. 

More specifically, what we would like to do is to establish a connection between resolution refu- 
tations of pebbling contradictions on the one hand, and the so-called black-white pebble game |[24l 
modelling the non-deterministic computations described by the underlying graphs on the other. 
Our intuition is that the resolution proof system should have to conform to the combinatorics of the 

'For our proof, it turns out that the exact definition of the derivation rule is not essential — our lower bound holds for 
any sound rule. What is important is that we are only allowed to derive new clauses that are implied by the set of clauses 
currently on the blackboard. 
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(x(r)i V x(r)2) 
A (x(s)i V 33(5)2) 
A (x(t)i Vx(i) 2 ) 

A (x(r) x V a;(s) x V x(ii)i V a; (14)2) 
A (x(r)j V x(s) 2 V x(n)i V x(u)2) 
A (x(r) 2 V x{s) l V x(n)i V aj(n)2) 
A (x(r) 2 V x(s) 2 V x(u)i V a?(«)2) 
A (ar(s) 1 V x(t) 1 V sc(v)i V x(v) 2 ) 
A (x(s) : V x(i) 2 V x(v)\ V x(t>) 2 ) 
A (x(s) 2 V rc(t) 1 V V x(v) 2 ) 

A (x(s) 2 V x(t) 2 V x(u )i V x(v) 2 ) 

Figure 1 : The pebbling contradiction Peb^ for the pyramid graph n 2 of height 2. 

pebble game in the sense that from any resolution refutation of a pebbling contradiction Peb d G we 
should be able to extract a pebbling of the DAG G. 

Ideally, we would like to give a proof of a lower bound on the resolution refutation space of 
pebbling contradictions along the following lines: 

1. First, find a natural interpretation of sets of clauses currently "on the blackboard" in a refuta- 
tion of the formula Peb G in terms of black and white pebbles on the vertices of the DAG G. 

2. Then, prove that this interpretation of clauses in terms of pebbles captures the pebble game 
in the following sense: for any resolution refutation of Peb G , looking at consecutive sets of 
clauses on the blackboard and considering the corresponding sets of pebbles in the graph we 
get a black-white pebbling of G in accordance with the rules of the pebble game. 

3. Finally, show that the interpretation captures clause space in the sense that if the content of 
the blackboard induces N pebbles on the graph, then there must be at least N clauses on the 
blackboard. 

Combining the above with known lower bounds on the pebbling price of G, this would imply a 
lower bound on the refutation space of pebbling contradictions and a separation from length and 
width. For clarity, let us spell out what the formal argument of this would look like. 

Consider an arbitrary resolution refutation of Peb G . From this refutation we extract a pebbling 
of G. At some point in time t in the obtained pebbling, there must be a lot of pebbles on the vertices 
of G since this graph was chosen with high pebbling price. But this means that at time t, there are a 
lot of clauses on the blackboard. Since this holds for any resolution refutation, the refutation space 
of Peb G must be large. The separation result now follows from the fact that pebbling contradictions 
are known to be refutable in linear length and constant width if d is fixed. 

Unfortunately, this idea does not quite work. In the next subsection, we describe the modifica- 
tions that we are forced to make, and show how we can make the bits and pieces of our construction 
fit together to yield Theorem [T7T] and Corollary II .21 for the special case of pyramid graphs. 



A (x(u) 1 V x(v) 1 V x{z)i V x(z) 2 ) 
A {x(u) l V x(v) 2 V x(z)i V x(z) 2 ) 
A (x(u) 2 V x(v) 1 V x(z)i V x(z) 2 ) 
A {x{u) 2 V x{v) 2 V x(z)i V x{z) 2 ) 
A x(z) l 



Ax{z) 2 
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x{u)\ V x(u)2 

x(s) 1 V x(t) 1 V x(v)i V x(v)2 
x(s) 1 V x(t) 2 V x(v)i V x(v)2 
x(s) 2 V V x(v)i V x(u)2 
x(s) 2 V x(t) 2 V V x(u)2 

(a) Clauses on blackboard. (b) Corresponding pebbles in the graph. 

Figure 2: Example of intuitive correspondence between sets of clauses and pebbles. 
2.3 Detailed Overview of Formal Proof of Space Bound 

The black-white pebble game played on a DAG G can be viewed as a way of proving the end result 
of the calculation described by G. Black pebbles denote proven partial results of the computation. 
White pebbles denote assumptions about partial results which have been used to derive other partial 
results (i.e., black pebbles), but these assumptions will have to be verified for the calculation to 
be complete. The final goal is a black pebble on the sink z and no other pebbles in the graph, 
corresponding to an unconditional proof of the end result of the calculation with any assumptions 
made along the way having been eliminated. 

Translating this to pebbling contradictions, it turns out that a fruitful way to think of a black 
pebble on v is that it should correspond to truth of the disjunction Vf=i x ( v )i °f ai l positive literals 
over v, or to "truth of v". A white pebble on a vertex w can be understood to mean that we need to 
assume the partial result on w to derive the black pebbles above w in the graph. Needing to assume 
the truth of w is the opposite of knowing the truth of w, so extending the reasoning above we get 
that a white-pebbled vertex should correspond to "falsity of w", i.e., to all negative literals x(w) i , 
i € [d\, over w. 

Using this intuitive correspondence, we can translate sets of clauses in a resolution refutation 
of PebQ into black and white pebbles in G as in Figure [2l It is easy to see that if we assume 
x(s)i V x(s)2 and x(t)i V x(t)2, this assumption together with the clauses on the blackboard 
in Figure [2(a)] imply x{v)\ V x(v)%, so v should be black-pebbled and s and t white-pebbled in 
Figure [2(b)] The vertex u is also black since x(u)i V x(u)2 certainly is implied by the blackboard. 
This translation from clauses to pebbles is arguably quite straightforward, and seems to yield well- 
behaved black-white pebblings for all "sensible" resolution refutations of Peb d G . 

The problem is that we have no guarantee that the resolution refutations will be "sensible". 
Even though it might seem more or less clear how an optimal refutation of a pebbling contradic- 
tion should proceed, a particular refutation might contain unintuitive and seemingly non-optimal 
derivation steps that do not make much sense from a pebble game perspective. In particular, a res- 
olution derivation has no obvious reason always to derive truth that is restricted to single vertices. 
For instance, it could add the axioms xiu) i V x(v) 2 V x(z)i V x(z)2, i = 1,2, to the blackboard 
in Figu re |2(a)[ derive that the truth of s and t implies the truth of either v or z, i.e., the clauses 
x(s) i V x(t)j V x(v)\ V x(z)i V x{z)2 for i, j = 1, 2, and then erase x{u)\ V x{u)2 from the black- 
board. Although it is hard to see from such a small example, this turns out to be a serious problem 
in that there appears to be no way that we can interpret such derivation steps in terms of black and 
white pebbles without making some component in the proof idea in S ection I2T21 break down. 

Instead, what we do is to invent a new pebble game, with white pebbles just as before, but 
with black blobs that can cover multiple vertices instead of single-vertex black pebbles. A blob on 
a vertex set V can be thought of as truth of some vertex v G V. The derivation sketched in the 
preceding paragraph, resulting in the set of clauses in Figure [3(a)] will then be translated into white 
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x(s) 1 V x(t) 1 V x(v)i V x(z)i V x(z)2 
x(s) 1 V x(t) 2 V V x(z)\ V x(^)2 

x(s) 2 V x(t) 1 V x(w)i V x(z)\ V x(z)2 
x(s) 2 V x(t) 2 V x(w)i V V x{z)2 

(a) New set of clauses on blackboard. (b) Corresponding blobs and pebbles. 

Figure 3: Intepreting sets of clauses as black blobs and white pebbles. 

pebbles on s and t as before and a black blob covering both v and z in Figure [3(b)] We define rules 
in this blob-pebble game corresponding roughly to black and white pebble placement and removal 
in the usual black-white pebble game, and add a special inflation rule allowing us to inflate black 
blobs to cover more vertices. 

Once we have this blob-pebble game, we use it to construct a lower bound proof as outlined in 
Section 12.21 First, we establish that for a fairly general class of graphs, any resolution refutation 
of a pebbling contradiction can be interpreted as a blob-pebbling on the DAG in terms of which 
this pebbling contradiction is defined. Intuitively, the reason that this works is that we can use the 
inflation rule to analyze apparently non-optimal steps in the refutation. 

Theorem 2.1. Let PebQ denote the pebbling contradiction of degree d > 1 over a layered DAG G. 
Then there is a translation function from sets of clauses derived from PebQ into sets of black 
blobs and white pebbles in G such that any resolution refutation ir of Peb G corresponds to a blob- 
pebbling V n of G under this translation. 

In fact, the only property that we need from the layered graphs in Theorem 12. H is that if w is a 
vertex with predecessors u and v, then there is no path between the siblings u and v. The theorem 
holds for any DAG satisfying this condition. 

Next, we carefully design a cost function for black blobs and white pebbles so that the cost of 
the blob-pebbling V n in Theorem 12. H is related to the space of the resolution refutation n. 

Theorem 2.2. If V is a refutation of a pebbling contradiction Peb G of degree d > 1, then the cost 
of the associated blob-pebbling is bounded by the space of it by COSt(V w ) < Sp(n) + 0(1). 

Without going into too much detail, in order to make the proof of Theorem l2.2l work we can only 
charge for black blobs having distinct lowest vertices (measured in topological order), so additional 
blobs with the same bottom vertices are free. Also, we can only charge for white pebbles below 
these bottom vertices. 

Finally, we need lower bounds on blob-pebbling price. Because of the inflation rule in com- 
bination with the peculiar cost function, the blob-pebble game seems to behave rather differently 
from the standard black-white pebble game, and therefore we cannot appeal directly to known 
lower bounds on black-white pebbling price. However, for a more restricted class of graphs than in 
Theorem 12. II but still including binary trees and pyramids, we manage to prove tight bounds on the 
blob-pebbling price by generalizing the lower bound construction for black- white pebbling in ll37l . 

Theorem 2.3. Any so-called layered spreading graph G^ of height h has blob-pebbling price @(h). 
In particular, this holds for pyramid graphs Il/j. 

Putting all of this together, we can prove our main theorem. 
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Theorem 11.11 (restated). Let Pebf^ h denote the pebbling contradiction of degree d > 1 defined 
over the pyramid graph of height h. Then the clause space of refuting Pebf^ by resolution is 
Sp(Pebf lh ^O) = e(h). 

Proof The upper bound Sp{Peb^ h 0) = 0(h) is easy. A pyramid of height h can be pebbled 
with h + 0(1) black pebbles, and a resolution refutation can mimic such a pebbling in constant 
extra clause space (independent of d) to refute the corresponding pebbling contradiction. 

The interesting part is the lower bound. Let ir be any resolution refutation of Pebfi h . Con- 
sider the associated blob-pebbling V n provided by Theorem 12.11 On the one hand, we know 
that C0St(V n ) =0(Sp(n)) by Theorem provided that d > 1. On the other hand, Theo- 
rem [23] tells us that the cost of any blob-pebbling of 11^ is Q(h), so in particular we must have 
COSt^jr) = Q(h). Combining these two bounds on C0St(V n ), we see that Sp(-K) = Q,(h). □ 

The pebbling contradiction Peb^ is a (2+<f)-CNF formula and for constant d the size of the 
formula is linear in the number of vertices of G (compare Figure [T]). Thus, for pyramid graphs Yi^ 
the corresponding pebbling contradictions Pebfi h have size quadratic in the height h. Also, when 
d is fixed the upper bounds mentioned at the end of Section |2~T1 become L(Peh d G h 0) = O(n) 
and W{Peb% h 0) = 0(1). Corollary O now follows if we set F n = Peb^ h for d = k - 2 and 
h = [\^n\ and use Theorem ll.il 

Corollary 11.21 (restated). For all k > 4 there is a family of k- CNF formulas {F n }^ =1 of size 0(n) 
such that L(F n h 0) = O(n) and W{F n h 0) = 0(1) but Sp(F n h 0) = 0(Vn). 

2.4 Overview of Trade-off Results 

Let us also quickly sketch the ideas (or tricks, really) used to prove our trade-off theorems for 
resolution. 

We show the following version of the length-variable space trade-off theorem of Hertel and 
Pitassi ll33l . with somewhat improved parameters and a very much simpler proof. 

Theorem 2.4. There is a family of CNF formulas {F n }^ =1 of size O(n) such that: 

• The minimal variable space of refuting F n in resolution is VarSp(F n h 0) = O(n). 

• Any resolution refutation it : F n h in minimal variable space has length exp(f2(y / n)). 

• Adding at most 2 extra units of storage, it is possible to obtain a resolution refutation it' in 
variable space VarSp(ir') = VarSp(F n h 0) + 3 = Q(n) and length L(tt') = 0(n), i.e., 
linear in the formula size. 

The idea behind our proof is as follows. Take formulas G n that are really hard for resolution and 
formulas H m which have short refutations but require linear variable space, and set F n = G n /\H m 
for m chosen so that VarSp (H m h 0) is only just larger than VarSp (G n h 0) . Then refutations in 
minimal variable space will have to take care of G n , which requires exponential length, but adding 
one or two literals to the memory we can attack H m instead in linear length. 

The trade-off result in Theorem 11.31 for length versus clause space and its twin theorem for 
length versus width are shown using similar ideas. 

2.5 Paper Organization 

Section [3]provides formal definitions of the concepts introduced in Sections Q] and |2j and Section|4] 
gives precise statements of the results mentioned there, as well as some other result relevant to this 
paper. The easy proofs of our trade-off theorems are then immediately presented in Section [5] 

The bulk of the paper is spent proving our main result in Theorem 11.11 In Section [6l we 
define our modified pebble game, the "blob-pebble game", that we will use to analyze resolution 
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refutations of pebbling contradictions. In Section [7J we prove that resolution refutations can be 
translated into pebblings in this game, which is Theorem 12.11 in Section 12.31 In Section [U we 
prove Theorem 12.21 saying that the blob-pebbling price accurately measures the clause space of the 
corresponding resolution refutation. Finally, after giving a detailed description of the lower bound 
on black- white pebbling of 0711 in Section [9] (with a somewhat simplified proof that might be of 
independent interest), in Section[lO]we generalize this result in a nontrivial way to our blob-pebble 
game. This gives us Theorem 12.31 Now Theorem 11.11 and Corollary 11.21 follow as in the proofs 
given at the end of Section [2731 

We conclude in Section [TUby giving suggestions for further research. 



In this section, we define resolution, pebble games and pebbling contradictions. 

3.1 The Resolution Proof System 

A literal is either a prepositional logic variable or its negation, denoted x and x, respectively. We 
define x = x. Two literals a and b are strictly distinct if a ^ b and a ^b, i.e., if they refer to 
distinct variables. 

A clause C = a\ V • • • V is a set of literals. Throughout this paper, all clauses C are assumed 
to be nontrivial in the sense that all literals in C are pairwise strictly distinct (otherwise C is trivially 
true). We say that C is a subclause of D if C C D. A clause containing at most k literals is called 
a k-clause. 

A CNF formula F = C\ A ■ • ■ A C m is a set of clauses. A k-CNF formula is a CNF formula 
consisting of A;-clauses. We define the size S{F) of the formula F to be the total number of literals 
in F counted with repetitions. More often, we will be interested in the number of clauses \F\ of F. 

In this paper, when nothing else is stated it is assumed that A, B, C, D denote clauses, C, O sets 
of clauses, x, y prepositional variables, a, 6, c literals, a, (3 truth value assignments and v a truth 
value or 1. We write 



to denote the truth value assignment that agrees with a everywhere except possibly at x, to which 
it assigns the value v. We let Vars(C) denote the set of variables and Lit(C) the set of literals in 
a clause C@ This notation is extended to sets of clauses by taking unions. Also, we employ the 
standard notation [n\ = {1, 2, . . . , n}. 

A resolution derivation -k : F h A of a clause A from a CNF formula F is a sequence of clauses 
7T = {D\, . . . , D T } such that D T = A and each line Di, i G [r], either is one of the clauses in F 
(axioms) or is derived from clauses Dj , in it with j,k < i by the resolution rule 



We refer to (13.21 ) as resolution on the variable x and to B V C as the resolvent of B V x and C V x 
on x. A resolution refutation of a CNF formula F is a resolution derivation of the empty clause 
(the clause with no literals) from F. Perhaps somewhat confusingly, this is sometimes also referred 
to as a resolution proof of F. 

For a formula F and a set of formulas Q = {G\, . . . , G n }, we say that Q implies F, denoted 
Q N F, if every truth value assignment satisfying all formulas G G Q satisfies F as well. It is 

2 Although the notation Lit(C) is slightly redundant given the definition of a clause as a set of literals, we include it 
for clarity. 



3 Formal Preliminaries 




(3.1) 
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well known that resolution is sound and implicationally complete. That is, if there is a resolution 
derivation 7r : F V- A, then F \= A, and if F (= A, then there is a resolution derivation tt : F h A' for 
some A' C A. In particular, F is unsatisfiable if and only if there is a resolution refutation of F. 

With every resolution derivation it : F h A we can associate a DAG G^, with the clauses in it 
labelling the vertices and with edges from the assumption clauses to the resolvent for each applica- 
tion of the resolution rule (I3.2I ). There might be several different derivations of a clause C in 7r, but 
if so we can label each occurrence of C with a timestamp when it was derived and keep track of 
which copy of C is used where. A resolution derivation it is tree-like if any clause in the derivation 
is used at most once as a premise in an application of the resolution rule, i.e., if is a tree. (We 
may make different "time-stamped" vertex copies of the axiom clauses in order to make G w into a 
tree). 

The length L(tt) of a resolution derivation it is the number of clauses in it. We define the length 
of deriving a clause A from a formula F as L(F h A) = min^jh a{L(tt)}, where the minimum 
is taken over all resolution derivations of A. In particular, the length of refuting F by resolution is 
denoted L(FhO). The length of refuting F by tree-like resolution L%{F h 0) is defined by taking 
the minimum over all tree-like resolution refutations ttt of F. 

The width W(C) of a clause C is |C|, i.e., the number of literals appearing in it. The width of 
a set of clauses C is W(C) = maxc G c{ W(C)}. The width of deriving A from F by resolution is 
W(F \- A) = mm^.p^Al W(ir)}, and the width of refuting F is denoted W(F h 0). Note that 
the minimum width measures in general and tree-like resolution coincide, so it makes no sense to 
make a separate definition for W%(F h 0). 

We next define the measure of space. Following the exposition in [28 ], a proof can be seen as 
a Turing machine computation, with a special read-only input tape from which the axioms can be 
downloaded and a working memory where all derivation steps are made. The clause space of a 
resolution proof is the maximum number of clauses that need to be kept in memory simultaneously 
during a verification of the proof. The variable space is the maximum total space needed, where 
also the width of the clauses is taken into account. 

For the formal definitions, it is convenient to use an alternative definition of resolution intro- 
duced in (21. 

Definition 3.1 (Resolution). A clause configuration C is a set of clauses. A sequence of clause 
configurations {Co, . . . , C r } is a resolution derivation from a CNF formula F if Co = and for 
all t G [t], Cf is obtained from Q_i by onedof the following rules: 

Axiom Download Cf = Ct_i U {C} for some C G F. 

Erasure C< = Ct-i \ {C} for some C G Ct-i. 

Inference Q = Q_i U {D} for some D inferred by resolution from C\, C*2 G Ct-\. 

A resolution derivation it : F h A of a clause A from a formula F is a derivation {Co, • • • , C T } 
such that C r = {^4}. A resolution refutation of F is a derivation of the empty clause from F. 

Definition 3.2 (Clause space BUHl). ^ ne c ^ ause space of a resolution derivation tt = {Co , ■ • • ,C r } 

is max te r T ]{|Ci|}. The clause space of deriving A from F is Sp(F h A) = mm^.p |_ a{Sp(tt)} , 
and Sp (F h 0) denotes the minimum clause space of any resolution refutation of F. 

Definition 3.3 (Variable space [2|). The variable space of a configuration C is VarSp(C) = 
2~^ceC W{C). The variable space of a derivation {Co, ■ • • ,C T } is max tg [ T ]{ VarSp(Ct)}, and 
VarSp (F h 0) is the minimum variable space of any resolution refutation of F. 

3 In some previous papers, resolution is defined so as to allow every derivation step to combine one or zero applications 
of each of the three derivation rules. Therefore, some of the bounds stated in this paper for space as defined next are off 
by a constant as compared to the cited sources. 
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Restricting the resolution derivations to tree-like resolution, we get the measures Sp^(F h 0) 
and VarSp<x(F h 0) in analogy with L%(F h 0) defined above. 

Note that if one wanted to be really precise, the size and space measures should probably 
measure the number of bits needed rather than the number of literals. However, counting literals 
makes matters substantially cleaner, and the difference is at most a logarithmic factor anyway. 
Therefore, counting literals seems to be the established way of measuring formula size and variable 
space. 

In this paper, we will be almost exclusively interested in the clause space of general resolution 
refutations. When we write simply "space" for brevity, we mean clause space. 

3.2 Pebble Games and Pebbling Contradictions 

Pebble games were devised for studying programming languages and compiler construction, but 
have found a variety of applications in computational complexity theory. In connection with reso- 
lution, pebble games have been employed both to analyze resolution derivations with respect to how 
much memory they consume (using the original definition of space in [28]) and to construct CNF 
formulas which are hard for different variants of resolution in various respects (see for example 
El USE! ED). An excellent survey of pebbling up to ca 1980 is RSI . 

The black pebbling price of a DAG G captures the memory space, i.e., the number of registers, 
required to perform the deterministic computation described by G. The space of a non-deterministic 
computation is measured by the black-white pebbling price of G. We say that vertices of G with 
indegree are sources and that vertices with outdegree are sinks or targets. In the following, 
unless otherwise stated we will assume that all DAGs under discussion have a unique sink and 
this sink will always be denoted z. The next definition is adapted from ll24l . though we use the 
established pebbling terminology introduced by [34]. 

Definition 3.4 (Pebble game). Suppose that G is a DAG with sources S and a unique target z. The 
black-white pebble game on G is the following one-player game. At any point in the game, there 
are black and white pebbles placed on some vertices of G, at most one pebble per vertex. A pebble 
configuration is a pair of subsets P = (B, W) of V(G), comprising the black-pebbled vertices B 
and white-pebbled vertices W. The rules of the game are as follows: 

1 . If all immediate predecessors of an empty vertex v have pebbles on them, a black pebble may 
be placed on v. In particular, a black pebble can always be placed on any vertex in S. 

2. A black pebble may be removed from any vertex at any time. 

3. A white pebble may be placed on any empty vertex at any time. 

4. If all immediate predecessors of a white-pebbled vertex v have pebbles on them, the white 
pebble on v may be removed. In particular, a white pebble can always be removed from a 
source vertex. 

A black-white pebbling from (B\, W\) to (B 2 , W 2 ) in G is a sequence of pebble configurations 
V = {P , • • • , P T } such that P = (B u W x ), P T = (B 2 , W 2 ), and for all t € [r], P t follows from 
Pt-i by one of the rules above. If {B\, W\) = (0, 0), we say that the pebbling is unconditional, 
otherwise it is conditional. 

The cost of a pebble configuration P = (B,W) is COSt(¥) = \B U W\ and the cost of a 
pebbling V = {Fq, . . . ,P r } is maxo<t< T {cosf(Pt)}. The black-white pebbling price of (B, W), 
denoted BW-Peb(B, W), is the minimum cost of any unconditional pebbling reaching (B, W). 

A complete pebbling of G, also called a pebbling strategy for G, is an unconditional pebbling 
reaching ({z},0). The black-white pebbling price of G, denoted BW-Peb(G), is the minimum 
cost of any complete black-white pebbling of G. 
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A black pebbling is a pebbling using black pebbles only, i.e., having Wt = for all t. The 
(black) pebbling price of G, denoted Peb(G), is the minimum cost of any complete black pebbling 
of G. 

We think of the moves in a pebbling as occurring at integral time intervals t = 1,2,... and talk 
about the pebbling move "at time t" (which is the move resulting in configuration P t ) or the moves 
"during the time interval [£i, £2]". 

The only pebblings we are really interested in are complete pebblings of G. However, when we 
prove lower bounds for pebbling price it will sometimes be convenient to be able to reason in terms 
of partial pebbling move sequences, i.e., conditional pebblings. 

A pebbling contradiction defined on a DAG G encodes the pebble game on G by postulating the 
sources to be true and the target to be false, and specifying that truth propagates through the graph 
according to the pebbling rules. The definition below is a generalization of formulas previously 
studied in lfT7ll49l 

Definition 3.5 (Pebbling contradiction JTH). Suppose that G is a DAG with sources S, a unique 
target z and with all non-source vertices having indegree 2, and let d > be an integer. Asso- 
ciate d distinct variables x(v)i, . . . , x{v)d with every vertex v G V(G). The dth degree pebbling 
contradiction over G, denoted Peb d G , is the conjunction of the following clauses: 

• Vf=i x ( s )i f° r all s G 5 (source axioms), 

• x(z) i for all i G [d] (target axioms), 

• x(u) i V x(v)j V Vf=i x ( w )i f° r all hj S [d] and all w G V(G) \ S, where u, v are the two 
predecessors of w (pebbling axioms). 

The formula Peb% is a (2+d)-CNF formula with 0{d 2 ■ \V(G)\) clauses over d ■ \V(G)\ vari- 
ables. An example pebbling contradiction is presented in Figure [T~on page 7 

4 Review of Related Work 

This section is an overview of related work, including formal statements of some previously known 
results that we will need. At the end of Section 14.31 we also try to provide some of the intuition 
behind the result proven in this paper. 

4.1 General Results About Resolution 

It is not hard to show that any CNF formula F over n variables is refutable in length 2 n+1 — 1 and 
width n. Esteban and Toran [28 ] proved that the clause space of refuting F is upper-bounded by the 
formula size. More precisely, the minimal clause space is at most the number of clauses, or the num- 
ber of variables, plus a small constant, or in formal notation Sp (F h 0) < min { | -F| , | Vars (F) \ } + 
0(1). 

We will need the fact that there are polynomial-size families of fc-CNF formulas that are very 
hard with respect to length, width and clause space, essentially meeting the upper bounds just 
stated. 

Theorem 4.1 ((2j |8j [13J HU |20j HU |59l). There are arbitrarily large unsatisfiable 3-CNF formu- 
las F n of size 0(n) with 0(n) clauses and 0(n) variables for which it holds that L(F n h 0) = 
exp(0(n)), W(F n h 0) = 0(n) and Sp(F n h 0) = 0(n). 
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Clearly, for such formulas F n it must also hold that Q(n) = VarSp(F n h 0) = 0(n 2 ). We 
note in passing that determining the exact variable space complexity of a formula family as in 
Theorem 14. ll was mentioned as an open problem in 0. To the best of our knowledge this problem 
is still unsolved. 

If a resolution refutation has constant width, it is easy to see that it must be of size polynomial in 
the number of variables (just count the maximum possible number of distinct clauses). Conversely, 
if all refutations of a formula are very wide, it seems reasonable that any refutation of this formula 
must be very long as well. This intuition was made precise by Ben-Sasson and Wigderson [ 15]. We 
state their theorem in the more explicit form of Segerlind [54]. 

Theorem 4.2 ( 1151 ). The width of refuting a CNF formula F is bounded from above by 

W{F h 0) < W{F) + 1 + 3v/nlnL(Fh 0) , 
where n is the number of variables in F. 

Bonet and Galesi lfl"8l showed that this bound on width in terms of length is essentially optimal. 
For the special case of tree-like resolution, however, it is possible get rid of the dependence of the 
number of variables and obtain a tighter bound. 

Theorem 4.3 ([15 1). The width of refuting a CNF formula F in tree-like resolution is bounded from 
above by W{F h 0) < W(F) + log L%(F h 0). 

For reference, we collect the result in |[T8Tl together with some other bounds showing that there 
are formulas that are easy with respect to length but moderately hard with respect to width and 
clause space and state them as a theorem^ 

Theorem 4.4 ([2, 18 , 55 1). There are arbitrarily large unsatisfiable 3- CNF formulas F n of size 
0(n 3 ) with 0(n 3 ) clauses and®(n 2 } variables such that W(F n h 0) = @(n) and Sp(F n h 0) = 
B(n), but for which there are resolution refutations ir n : F n h in length L(jT ri ) = 0(n 3 ), width 
W(ir n ) = O(n) and clause space Sp(ir n ) = O(n). 

As was mentioned above, the fact that all known lower bounds on refutation clause space co- 
incided with lower bounds on width lead to the conjecture that the width measure is a lower bound 
for the clause space measure. This conjecture was proven true by Atserias and Dalmau lISl . 

Theorem 4.5 (|[5|). For any CNF formula F, it holds that Sp{Fh 0) - 3 > W(F h 0) - W(F). 

In other words, the extra clause space exceeding the minimum 3 needed for any resolution 
derivation is bounded from below by the extra width exceeding the width of the formula. This 
inequality was later shown by the first author to be asymptotically strict in the following sense. 

Theorem 4.6 (|42l). For all k > 4, there is a family {Fn}^^ of k-CNF formulas of size 0(n) 
such that L(F n h 0) = O(n) and W(F n h 0) = 0(1) but Sp{F n h 0) = 6(logn). 

An immediate corollary of Theorem 14.51 is that for polynomial-size fc-CNF formulas constant 
clause space implies polynomial proof length. We are interested in finding out what holds in the 
other direction, i.e., if upper bounds on length imply upper bounds on space. 

For the special case of tree-like resolution, it is known that there is an upper bound on clause 
space in terms of length exactly analogous to the one on width in terms of length in Theorem 14.31 

4 Note that 1 181, where an explicit resolution refutation upper-bounding the proof complexity measures is presented, 
does not talk about clause space, but it is straightforward to verify that the refutation there can be carried out in 
length 0(n 3 ) and clause space O(n). 
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Theorem 4.7 (|28j). For any tree-like resolution refutation tt of a CNF formula F it holds that 
Sp(ir) < riogL(vr)l + 2. In particular, Sp(F h 0) < [log L % (F h 0)] + 2. 

For general resolution, since clause space is lower-bounded by width according to Theorem l4.5l 
the separation of width and length of iPTSl in Theorem 14.41 tells us that /c-CNF formulas refutable 
in polynomial length can still have "somewhat spacious" minimum-space refutations. But exactly 
how spacious can they be? Does space behave as width with respect to length also in general 
resolution, or can one get stronger lower bounds on space for formulas refutable in polynomial 
length? 

All polynomial lower bounds on clause space known prior to this paper can be explained as im- 
mediate consequences of Theorem 14 . 5 1 applied on lower bounds on width. Clearly, any space lower 
bounds derived in this way cannot get us beyond the "Ben-Sasson-Wigderson barrier" implied by 
Theorem 14.21 saying that if the width of refuting F is (^(y^F) log|F|), then the length of refuting 
F must be superpolynomial in \F\. Also, since matching upper bounds on clause space have been 
known for all of these formula families, they have not been candidates for showing stronger sepa- 
rations of space and length. Thus, the best known separation of clause space and length has been 
the formulas in Theorem 14.41 refutable in linear length L(F n h 0) = 0(|-F n |) but requiring space 
Sp (F n h0) = ( y | F n | ) , as implied by the same bound on width. 

Let us also discuss upper bounds on what kind of separations are a priori possible. Given 
any resolution refutation tt : F h 0, we can write down its DAG representation G n (described on 
page [121 with L(tt) vertices corresponding to the clauses, and with all non-source vertices having 
fan-in 2. We can then transform tt into as space-efficient a refutation as possible by considering 
an optimal black pebbling of G n as follows: when a pebble is placed on a vertex we derive the 
corresponding clause, and when the pebble is removed again we erase the clause from memory. 
This yields a refutation n' in clause space Peb(G 7T ) (incidentally, this is the original definition 
in ll28l of the clause space of a resolution refution tt). Since it is known that any constant indegree 
DAG on n vertices can be black-pebbled in cost 0(n/ log n) (see Theorem 14.101 ). this shows that 
Sp (F \- 0) = 0\L(F \- 0)/ log L(F h 0)) is a trivial upper bound on space in terms of length. 

Now we can rephrase the question above about space and length in the following way: Is there 
a Ben-Sasson-Wigderson kind of lower bound, say L(F h 0) = exp(f2(5p(.F h 0) 2 /|F|)) or 
so, on length in terms of space? Or do there exist fc-CNF formulas F with short refutations but 
maximum possible refutation space Sp (F h 0) = Q (L(F h 0)/ log L(F \- 0)) in terms of length? 
Note that the refutation length L(F h 0) must indeed be short in this case — essentially linear, since 
any formula F can be refuted in space 0(|F|) as was noted above. Or is the relation between 
refutation space and refutation length somewhere in between these extremes? 

This is the main question addressed in this paper. We believe that clause space and length can 
be strongly separated in the sense that there are formula families with maximum possible refutation 
space in terms of length. As a step towards proving this we improve the lower bound in Theorem l4.6l 
from B(logn) to 6(y / n), thus providing the first polynomial lower bound on space that is not the 
consequence of a corresponding bound on width. We next review some results about the tools that 
we use to do this. 

4.2 Results About Pebble Games 

There is an extensive literature on pebbling, mostly from the 70s and 80s. We just quickly mention 
four results relevant to this paper. 

Perhaps the simplest graphs to pebble are complete binary trees of height h. The black 
pebbling price of Th can be established by an easy induction over the tree height. For black-white 
pebbling, general bounds for the pebbling price of trees of any arity were presented in [39]. For 
the case of binary trees, this result can be simplified to an exact equality (a proof of which can be 
found in Section 4 of EH). 
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Theorem 4.8. For a complete binary tree of height h > 1 it holds that Peb(Th) = h + 2 and 
BW-Peb{T h ) = L|J + 3. 

In this paper, we will focus on pyramid graphs, an example of which can be found in Figure [T] 

Theorem 4.9 ((22j|33). For a pyramid graph TL h of height h > 1 it holds that Peb(U h ) = h + 2 
and BW-Peb{TL h ) = h/2 + 0(1). 

As we wrote in Section [2j we are interested in DAGs with as high a pebbling price as possible 
measured in terms of the number of vertices. For a DAG G with n vertices and constant in-degree, 
the best we can hope for is 0(n/ log n). 

Theorem 4.10 (|34|). For directed acyclic graphs G with n vertices and constant maximum inde- 
gree, it holds that Peb(G) = 0(n/logn). 

This bound is asymptotically tight both for black and black-white pebbling. 

Theorem 4.11 ( II3HI4710 . There is a family of explicitly constructibl^DAGs G n with 0(n) vertices 
and vertex indegrees or 2 such that Peb(G) = Q(n/ log n) and BW-Peb(G) = 6(n/ log n). 

It should be pointed out that although the black and black-white pebbling prices coincide 
asymptotically in all of the theorems above, this is not the case in general. In ll35Tl . a family of 
DAGs with a quadratic difference in the number of pebbles between the black and the black-white 
pebble game was presented. We note that this is the best separation possible, since by [40] the 
difference in black and black-white pebbling price can be at most quadratic. 

4.3 Results About Pebbling Contradictions Plus Some Intuition 

Although any constant indegree will be fine for the results covered in this subsection, we restrict 
our attention to DAGs with vertex indegrees or 2 since these are the graphs that will be studied in 
the rest of this paper. 

It was observed in [14J that Peb G can be refuted in resolution by deriving Vj=i x ( v )i f° r all 
v 6 V(G) inductively in topological order and then resolving with the target axioms x(z) i , i E [d]. 
Writing down this resolution proof, one gets the following proposition (which is proven together 
with Proposition 14. 15l below). 

Proposition 4.12 (|14|). For any DAG G with all vertices having indegree or 2, there is a reso- 
lution refutation ir : Peb G \~0 in length L(tt) = 0(d 2 ■ \V(G)\) andwidth W(ir) = 0(d). 

Tree-like resolution is good at refuting first-degree pebbling contradictions Peb}^ but is bad at 
refuting Peh d G for d > 2. 

Theorem 4.13 ( Hi ll ). For any DAG G with all vertices having indegree or 2, there is a tree-like 
resolution refutation ir of Peb)j such that L(tt) = 0(\V(G)\) and Sp(ir) = 0(1). 

Theorem 4.14 ( lH4l ). For any DAG G with all vertices having indegree or 2, Li(Peb 2 G h 0) = 
2 ^(Pefc(G)) 

As to space, it is not too difficult to see that the black pebbling price of G provides an upper 
bound for the refutation clause space of Peb G . 

Proposition 4.15. For any DAG G with vertex indegrees or 2, Sp (Peb G h 0) < Peb(G) + 0(1). 

5 This was not known at the time of the original theorems in 13 II 1471 . What is needed is an explicit construction of 
superconcentrators of linear density, and it has since been shown how to do this (with |4| apparently being the currently 
best construction). 
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Essentially, this is just a matter of combining an optimal black pebbling of G with the resolution 
refutation idea from lfl4l sketched above. Since we need the upper bounds on width and space 
in Propositions 14.121 and 14.151 in the proof of our main theorem, we write down the details for 
completeness. 

Proof of Propositions \4. 12\ and \4.15\ Consider first the bound on space. 

Given a black pebbling of G, we construct a resolution refutation of Peb G such that if at 
some point in time there are black pebbles on a set of vertices V, then we have the clauses 
{Vi=i x i v )i I v ^ ^} i n memory. When some new vertex v is pebbled, we derive \jf =l x(v)i 
from the clauses already in memory. We claim that with a little care, this can be done in con- 
stant extra space independent of d. When a black pebble is removed from v, we erase the clause 
Vi=i x(v){. We conclude the resolution proof by resolving Vf=i x i z )i f° r the target z with all 
target axioms x{z) i , i G [d], in space 3. 

It is clear that given our claim about the constant extra space needed when a vertex is black- 
pebbled, this yields a resolution refutation in space equal to the pebbling cost plus some constant. 
In particular, given an optimal black pebbling of G, we get a refutation in space Peb(G) + 0(1). 

To prove the claim, note first that it trivially holds for source vertices v, since Vf=i x ( v )i is 
an axiom of the formula. Suppose for a non-source vertex r with predecessors p and q that at 
some point in time a black pebble is placed on r. Then p and q must be black-pebbled, so by 
induction we have the clauses Vf=i x (p)i an d Vj=i x ( ( ?)i i n memory. We will use that the clause 
x (p)i V Vf=i x i r )i f° r an y i can be derived in additional space 3 by resolving Vf=i x (l)j w ith 
x(p) i V x(q)- V Vf=i x ( r )i f° r 3 £ [d], leaving the easy verification of this fact to the reader. To 
derive \/f=i x i r )h first resolve \/f=i x iv)i with x (p) l V Vf=i x ( r )i to get \/f =2 x(p)i V Vf=i x ( r )h 
and then resolve this clause with the clauses x(p) i V Vf=i x ( r )i for i = 2, . . . , d one by one to get 
Vf=i x i r )i m total extra space 4. 

It is easy to see that this proof has width 0(d), which proves the claim about width in Propo- 
sition 14.121 To get the claim about length, we observe that the subderivation needed when a 
vertex is black-pebbled has length 0(d 2 ). If we use a pebbling that black-pebbles all vertices 
once in topological order without ever removing a pebble, we get a refutation in length L(tt) = 
0(d 2 -\V(G)\). ' □ 

Thus, the refutation clause space of a pebbling contradiction is upper-bounded by the black 
pebbling price of the underlying DAG. Proposition 14. 15 l is not quite an optimal strategy with respect 
to clause space, though. For binary trees [29] improved this bound somewhat to Sp(Pebj> h \~ 0) < 
|/i + 0(1) by constructing resolution proofs that try to mimic not black pebblings but instead 
optimal black-white pebblings of as presented in j39l . And for one variable per vertex, we know 
from Theorem gJ3] that Sp{Peb l G h 0) = 0(1). 

Proving lower bounds on space for pebbling contradictions of degree d > 2 has turned out 
to be much harder. For quite some time there was no lower bound on Sp(Peb G h 0) for any 
DAG G in general resolution (in terms of pebbling price or otherwise). In ll29l . a lower bound 
Sp % (Pebi hO) = H 0(1) was obtained for the special case of tree-like resolution. Unfortu- 
nately, this does not tell us anything about general resolution. For tree-like resolution, if the only 
way of deriving a clause D is from clauses C\, C2 such that Sp^(F h C») > s, then it holds that 
Sp%(F ^~ D) > s + 1 since one of the clauses C{ must be kept in memory while deriving the 
other clause. This seems to be very different from how general resolution works with respect to 
space. In 1421 . the first author showed a lower bound Sp(Peb^ h h 0) = £l(h) for binary trees and 
d > 2, which matches the upper bound up to a constant factor. As the techniques in l42l do not 
yield anything for more general graphs, this is all that was known prior to this paper. 



18 



4 REVIEW OF RELATED WORK 



We now try to present our own intuition for what the correct lower bound on the refutation 
clause space of pebbling contradictions should be. Although the reasoning is quite informal and 
non-rigorous, our hope is that it will help the reader to navigate the formal proofs that will follow. 

As we noted above, the resolution refutation of Ptb\ in |f29l used to prove the |/i + 0(1) 
upper bound for binary tree pebbling contradictions is structurally quite similar to the optimal 
black- white pebbling of Th presented in [39], and it somehow feels implausible that any resolution 
refutation would be able to do significantly better. Also, the lower bound in [42 ] is proven by relat- 
ing resolution refutations to black-white pebblings and deriving a lower bound on clause space in 
terms of pebbling price. This raises the suspicion that the black-white pebbling price BW-Peb(G) 
might be a lower bound for Sp(PebQ h 0) also for more general graphs as long as d > 2. 

This suspicion is somewhat strengthened by the fact that for variable space, we do have such a 
lower bound in terms of black-white pebbling pricejf] 

Theorem 4.16 (mil). For any d G N+ VarSp{Peb d G h 0) > BW-Peb(G). 

If the refutation clause space of pebbling contradictions for general DAGs would be constant or 
very slowly growing, Theorem 14. 1 61 would imply that as BW-Peb(G) grows larger, the clauses in 
memory get wider, and thus weaker. Still it would somehow be possible to derive a contradiction 
from a very small number of these clauses of unbounded width. This appears counterintuitive. 

On the other hand, for one variable per vertex, i.e., d = 1, refutations of Peb G in constant 
space have exactly these "counterintuitive" properties. The resolution refutation of PebQ in Theo- 
rem [4T3] is constructed by first downloading the pebbling axiom for the target z and then moving 
the false literals downwards by resolving with pebbling axioms for vertices v G V(G) \S in reverse 
topological order. This finally yields a clause \/ veS x(v) l V x(z)± of width \S\ + 1, which can be 
eliminated by resolving with the source axioms x(v)\ one by one for all v G S and then with the 
target axiom x{z) l to yield the empty clause 0. 

If we want to establish a non-constant lower bound on Sp (Peb G h 0) for d > 2, we have to pin 
down why this case is different. Intuitively, the difference is that with only one variable per vertex, a 
single clause x(v\) 1 V ... V x{v m ) 1 can express the disjunction of the falsity of an arbitrary number 
of vertices v\,. . . ,v m , but for d = 2, the straightforward way of expressing that both variables 
x{vi)\ and x{vi)2 are false for at least one out of m vertices requires 2 m clauses. 

As was argued in Section [2 to prove a lower bound on the refutation clause space of pebbling 
contradictions it seems natural to try to interpret resolution refutations of Peb G in terms of peb- 
blings of the underlying graph G. Let us say that a vertex v is "true" if Vf=i x i v )i has been derived 
and "false" if x{v) i has been derived for all i G [d\. Any resolution proof refutes a pebbling contra- 
diction by deriving that some vertex v is both true and false and then resolving to get 0. Let w be 
any vertex with predecessors u, v. Then we can see that if we have derived that u and v are true, by 
downloading x{u) i V x(v)j V Vf=i x ( w )i f° r au h j £ [d] we can derive Vf=i x ( w )l- This appears 
analogous to the rule that if u and v are black-pebbled we can place a black pebble on w. In the op- 
posite direction, if we know x{w) l for all I G [d], using the axioms x(u) i V x(v)j V Vf=i x i w )i we 
can derive that either u or v is false. This looks similar to eliminating a white pebble on w by plac- 
ing white pebbles on the predecessors u and v, and then removing the pebble from w. Generalizing 
this loose, intuitive reasoning, we argue that a set of black-pebbled vertices V should correspond 
to the derived conjunction of truth of all v G V, and that a set of white-pebbled vertices W should 
correspond to the derived disjunction of falsity of some w G W. 

Suppose that we could show that as the resolution derivation proceeds, the black and white peb- 
bles corresponding to different clause configurations as outlined above move about on the vertices 
of G in accordance with the rules of the pebble game. If so, we would get that there is some clause 
configuration C corresponding to a lot of pebbles. This could in turn hopefully yield a lower bound 

6 To be precise, the result in flTI is for d = 1, but the proof generalizes easily to any d £ N + . 
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for the refutation clause space. For if C corresponds to N black pebbles, i.e., implies N disjoint 
clauses, it seems likely that |C| should be linear in N. And if C corresponds to N white pebbles, 
|C| should grow with N if d > 2, since C has to force d literals false simultaneously for one out of 
N vertices. 

This is the guiding intuition that served as a starting point for proving the results in this paper. 
And although quite a few complications arise along the way, we believe that it is important when 
reading the paper not to let all technical details obscure the rather simple intuitive correspondence 
sketched above. 

5 A Simplified Way of Proving Trade-off Results 

Before we launch into the proof of the main result of this paper, however, we quickly present 
our simplification of the length-space trade-off result in [33], and show how the same ideas can 
be used to prove other related theorems. We also point out two key ingredients needed for our 
proofs to work and discuss possible conclusions to be drawn regarding proving trade-off results for 
resolution. We remark that this section is a somewhat polished write-up of the results previously 
announced in ll43l . 

We will need the following easy observation. 

Observation 5.1. Suppose that F = G A H where G and H are unsatisfiable CNF formulas over 
disjoint sets of variables. Then any resolution refutation ir : F h must contain a refutation of 
either G or H. 

Proof. By induction, we can never resolve a clause derived from G with a clause derived from H, 
since the sets of variables of the two clauses are disjoint. □ 

5.1 A Proof of Hertel and Pitassi's Trade-off Result 

Using the notation in Section[3l and improving the parameters somewhat, the length-variable space 
trade-off theorem of Hertel and Pitassi ll33l can be stated as follows. 

Theorem I2.4I (restated). There is a family of CNF formulas {i ? n }^L 1 of size 0(ra) such that: 

• The minimal variable space of refuting F n in resolution is VarSp(F n h 0) = 0(n). 

• Any resolution refutation n : F n h in minimal variable space has length exp(S7(y / n)). 

• Adding at most 2 extra units of storage, one can obtain a refutation tt' in space VarSp(ir') = 
VarSp(F n h 0) + 3 = 0(n) and length L(ir') = O(n), i.e., linear in the formula size. 

We note that the CNF formulas used by Hertel and Pitassi, as well as those in our proof, have 
clauses of width 0(ra). 

Proof of Theorem \2~4\ Let G n be CNF formulas as in Theorem 14. 1 1 having size 0(n), refutation 
length L(G n h 0) = exp(f2(ra)) and refutation clause space Sp(G n h 0) = 0(n). Let us define 
g(n) = VarSp(G n h 0) to be the refutation variable space of the formulas. Then it holds that 
fi( n ) = g{ n ) = 0(n 2 ). 
Let H m be the formulas 

H m = yi A • • • A y m A {y x V • • • V y m ) . (5.1) 

It is not hard to see that there are resolution refutations 7r : H m h in length L(ir) = 2m + 1 and 
variable space VarSp(-7r) = 2m, and that L(H m h 0) = 2m + 1 and VarSp(H m h 0) = 2m are 
also the lower bounds (all clauses must be used in any refutation, and the minimum space refutation 
must start by downloading the wide clause and some unit clause, and then resolve). 
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Now define 

F n = G n A #L9(«)/2j+i (5-2) 

where G n and i?ig( n )/2j+i have disjoint sets of variables. By Observation 15.11 any resolution 
refutation of F n refutes either G n or i?ig( n )/2j+i- We have 

VarSp{H [g{n)/2i+1 h Cl) = 2 • (L<?00/2j + 1) > g{n) = VarSp{G n h 0) , (5.3) 

so a resolution refutation in minimal variable space must refute G n in length exp(f2(n)). However, 
allowing at most two more literals in memory, the resolution refutation can disprove the formula 
#Ls(n)/2j+i instead in length linear in the (total) formula size. 

Thus, we have a formula family {Fn}^^ of size n(n) = S(F n ) = 0(n 2 ) refutable in length 
and variable space both linear in the formula size, but where any minimum variable space refutation 
must have length exp(fi(n)). Adjusting the indices as needed, we get a formula family with a trade- 
off of the form stated in Theorem l2.4l □ 

5.2 Some Other Trade-off Results for Resolution 

Using a similar trick as in the previous subsection, we can prove the following length-clause space 
trade-off. 

Theorem 11.31 (restated). There is a family of k- CNF formulas {F n }^ =1 of size 0(n) such that: 

• The minimal clause space of refuting F n in resolution is Sp(F n h 0) = Q(tfn). 

• Any resolution refutation it : F n h in minimal clause space must have length L(ir) = 
exp(Q(^)). 

• There are resolution refutations n' : F n \~0 in asymptotically minimal clause space Sp(ir') = 
0(Sp(F n h 0)) and length L(tt') = 0(n), i.e., linear in the formula size. 

The same game can be played with refutation width as well. 

Theorem 5.2. There is a family of k-CNF formulas {Fn}^ =1 of size Q(n) such that: 

• The minimal width of refuting F n is W(F n h 0) = 0(-y/n). 

• Any refutation it : F n V~ in minimal width must have length L(ir) = exp(il(-y/ra)). 

• There are refutations tt' : F n h with W(ir') = 0( W(F n h 0)) and L(ir') = O(n). 

We only present the proof of Theorem ll.3l as Theorem l5.2l is proved in exactly the same manner. 

Proof of Theorem [73] Let G n be a 3-CNF formula family as in Theorem 14. 1 1 having size G(n), 
refutation length L(G n h 0) = exp(0(n)), and refutation clause space Sp(G n h 0) = 0(n). Let 
H m be a 3-CNF formula family as in Theorem l4.4l of size 0(m 3 ) such that L(H m h 0) = 0(m 3 ) 
and Sp(H m h 0) = 6(m). Define 

g(n) = mm{m \ Sp(H m h 0) > Sp(G n h 0)} . (5.4) 

Note that since Sp{H m h 0) = Q(m) and 5p(G n h 0) = O(n), we know that g(n) = O(n). 

Now as before let F n = G n A H g ^, where G n and H g ^ have disjoint sets of variables. By 
Observation 15.11 any resolution refutation of F n is a refutation of either G n or H g ( n y Since g{n) 
has been chosen so that Sp (H g ^ h 0) > Sp (G n h0),a refutation in minimal clause space has to 
refute G n , which requires exponential length. However, since g(n) = O(n), Theorem 14.41 tells us 
that there are refutations of H g ^ in length 0(n 3 ) and clause space O(n). □ 
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5.3 Making the Main Trick Explicit 

The proofs of the theorems in Sections |5. 1 1 and 15^21 come very easily; in fact almost too easily. 
What is it that makes this possible? In this and the next subsection, we want to highlight two key 
ingredients in the constructions. 

The common paradigm for the proofs of Theorems ll.3112.41 and !5.2l is as follows. We are given 
two complexity measures M\ and Mi that we want to trade off against one another. We do this by 
finding formulas G n and H m such that 

• The formulas G n are very hard with respect to the first resource measured by Mi, while 
M2 {G n ) is at most some (more or less trivial) upper bound, 

• The formulas H m are very easy with respect to M\, but there is some nontrivial lower bound 
on the usage Mi(H m ) of the second resource, 

• The index m = m(n) is chosen so as to minimize Mi{H m (n\) — Mi(G n ) > 0, i.e., so that 
H m i n \ requires just a little bit more of the second resource than G n . 

Then for F n = G n A H m r n ) , if we demand that a resolution refutation it must use the minimal 
amount of the second resource, it will have to use a large amount of the first resource. However, re- 
laxing the requirement on the second resource by the very small expression M2 (H m {n)) —Mi (G n ) , 
we can get a refutation tt' using small amounts of both resources. 

Clearly, the formula families {-Fnj^li tnat we § et i n tn ^ s wa y Me "redundant" in the sense 
that each formula F n is the conjunction of two formulas G n and H m which are themselves already 
unsatisfiable. Formally, we say that a formula F is minimally unsatisfiable if F is unsatisfiable, but 
removing any clause C £ F, the remaining subformula F \ {C} is satisfiable. We note that if we 
would add the requirement in S ections [57TI and 15^21 that the formulas under consideration should be 
minimally unsatisfiable, the proof idea outlined above fails completely. In contrast, the result in 
ll33l seems to be independent of any such conditions. What conclusions can be drawn from this? 

On the one hand, trade-off results for minimally unsatisfiable formulas seem more interesting, 
since they tell us something about a property that some natural formula family has, rather than 
about some funny phenomena arising because we glue together two totally unrelated formulas. 

On the other hand, one could argue that the main motivation for studying space is the connection 
to memory requirements for proof search algorithms, for instance algorithms using clause learning. 
And for such algorithms, a minimality condition might appear somewhat arbitrary. There are no 
guarantees that "real-life" formulas will be minimally unsatisfiable, and most probably there is no 
efficient way of testing this condition]!] So in practice, trade-off results for non-minimal formulas 
might be just as interesting. 

5.4 An Auxiliary Trick for Variable Space 

A second important reason why our proof of Theorem 12 .4 1 gives sharp results is that we are allowed 
to use CNF formulas of growing width. It is precisely because of this that we can easily construct 
the needed formulas H. m that are hard with respect to variable space but easy with respect to length. 
If we would have to restrict ourselves to &-CNF formulas for k constant, it would be much more 
difficult to find such examples. Although the formulas in Theorem l4.4l could be plugged in to give a 
slightly weaker trade-off, we are not aware of any family of /c-CNF formulas that can provably give 
the very sharp result in Theorem [23] (Note, though, that the formula families used in the proofs of 
Theorems II .3l and l5.2l consist of fc-CNF formulas). 

7 The problem of deciding minimal unsatisfiability is NP-hard but not known to be in NP. Formally, a language L is 
in the complexity class DP if and only if there are two languages L\ G NP and L2 G CO-NP such that L = L\ nZ/2 1451 . 
MINIMAL UNSATISFIABILITY is DP-complete (46), and it seems to be commonly believed that DP ^ NP U CO-NP. 
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This is not the only example of a space measure behaving badly for formulas of growing width. 
We already discussed the lower bound Sp(F h 0) > W{F h 0) - W{F) + 3 on clause space 
in terms of length in Theorem 14.51 and the result in Theorem I4.6l that this inequality is asymptot- 
ically strict in the sense that there are /c-CNF formula families F n with W(F n h 0) = 0(1) but 
Sp(F n \-0) = 0(logn). 

However, if we are allowed to consider formulas of growing width, the fact that the inequality 
in Theorem 14.51 is not tight is entirely trivial. Namely, let us say that a CNF formula F is k-wide 
if all clauses in F have size at least k. In [28], it was proven that for F a /c-wide unsatisfiable 
CNF formula it holds that Sp (F h 0) > k + 2. So in order to get a formula family F n such that 
W(F n h 0) - W(F n ) = 0(1) but Sp(F n h 0) = w(l), just pick some suitable formulas {F n }™ =1 
of growing width. 

In our opinion, these phenomena are clearly artificial. Since every CNF formula can be rewrit- 
ten as an equivalent A;-CNF formula without increasing the size more than linearly, the right ap- 
proach when studying space measures in resolution seems to be to require that the formulas under 
study should have constant width. 

As a final comment before moving on to our main result, we note that the open trade-off ques- 
tions mentioned in Section[TT]do not suffer from the technical problems discussed above. 

6 A Game for Analyzing Pebbling Contradictions 

We now start our construction for the proof of Theorem ll.il which will require the rest of this paper. 
In this section we present the modified pebble game that we will use to study the clause space of 
resolution refutations of pebbling contradictions. 

6.1 Some Graph Notation and Definitions 

We first present some notation and terminology that will be used in what follows. See Figure @]for 
an illustration of the next definition. 

Definition 6.1. We let succ(v) denote the immediate successors and pred(v) denote the immediate 
predecessors of a vertex v in a DAG G. Taking the transivite closures of succ(-) and pred(-), we 
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let G v u denote all vertices reachable from v (vertices "above" v) and G^ denote all vertices from 
which v is reachable (vertices "below" v). We write g\ and G^ to denote the corresponding sets 
with the vertex v itself removed. If pred(v) = {u, w}, we say that u and w are siblings. If u G" G'" h 
and v G" G^, we say that u and v are non-comparable vertices. Otherwise they are comparable. 

When reasoning about arbitrary vertices we will often use as a canonical example a vertex r 
with assumed predecessors pred(r) = {p, q}. 

Note that for a leaf v we have pred(v) = 0, and for the sink z of G we have succ(z) = 0. 
Also note that G" h and G^ are sets of vertices, not subgraphs. However, we will allow ourselves 
to overload the notation and sometimes use this notation both for the subgraph and its vertices. 
Moreover, as a rule we will overload the notation for the graph G itself and its vertices, and usually 
write only G when we mean V(G), and when this should be clear from context. 

For our pebble game to work, we require of the graphs under study that they have the following 
property. 

Property 6.2 (Sibling non-reachability). We say that a DAG G has the Sibling non-reachability 
property if for all vertices u and v that are siblings in G, it holds that u G' G\ and v G%, i.e., the 
siblings are not reachable from one another. 

Phrased differently, Property [6.21 asserts that siblings are non-comparable. 

A sufficient condition for Property 16.21 to hold is that if v is reachable from u, then all paths 
P : u ~» v have the same length. This holds for instance for the class of layered graphs, and it is 
also easy to see directly that layered graphs possess Property 16.21 

Definition 6.3 (Layered DAG). A layered DAG G is a DAG whose vertices are partitioned into 
(nonempty) sets of layers Vq, V\, . . . , Vh on levels 0, 1, . . . , h, and whose edges run between con- 
secutive layers. That is, if (u, v) is a directed edge, then the level of u is L — 1 and the level of v is 
L for some L G [h] . We say that h is the height of the layered DAG G. 

Throughout this paper, we will assume that all source vertices in a layered DAG are located on 
the bottom level 0. Let us next give a formal definitions of the pyramid graphs that are the focus of 
this paper. 

Definition 6.4 (Pyramid graph). The pyramid graph Uh of height h is a layered DAG with h + 1 
levels, where there is one vertex on the highest level (the sink z), two vertices on the next level et 
cetera down to h + 1 vertices at the lowest level 0. The ith vertex at level L has incoming edges 
from the ith and (i + l)st vertices at level L — 1. 

We also need some notation for contiguous and non-contiguous topologically ordered sets of 
vertices in a DAG. 

Definition 6.5 (Paths and chains). We say that V is a (totally) ordered set of vertices in a DAG G, 
or a chain, if all vertices in V are comparable (i.e., if for all u, v G V, either u G or v G G^). A 
path P is a contiguous chain, i.e., such that succ(v) n P ^ for all v G P except the top vertex. 

We write P : v ~^ w to denote a path starting in v and ending in w. A source path is a path that 
starts at some source vertex of G. A path via w is a path such that w G P. We will also say that P 
visits w. For a chain V, we let 

• bot(V) denote the bottom vertex of V, i.e., the unique v G V such that V C G^, 

• top(F) denote the top vertex of V , i.e., the unique v G V such that V C G v h , 

• ^Pin(^) denote the set of all paths P : bot(y) top(F) via V or agreeing with V, i.e., 
such that yep, and 
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• ^Pvia(^) denote the set of all source paths agreeing with V. 

We write |J ^Pin(V) to denote the union of the vertices in all paths P G ^Pin(V) and [j ^p v ia(^0 for 
the union of all vertices in paths P € tyvmiV). 

In the rest of this paper, we will almost exclusively discuss DAGs with certain structural prop- 
erties. The next definition is so that we will not have to repeat these properties over and over again. 

Definition 6.6 (Blob-pebblable DAG). A blob-pebblable DAG is a DAG that has a unique sink, 
which we will alway denote z, that has vertex indegree 2 for all non-sources, and that satisfies the 
Sibling non-reachability property 16.21 

6.2 Description of the Blob-Pebble Game and Formal Definition 

To prove a lower bound on the refutation space of pebbling contradictions, we want to interpret 
derivation steps in terms of pebble placements and removals in the corresponding graph. In Sec- 
tion[2l we outlined an intuitive correspondence between clauses and pebbles. The problem is that if 
we try to use this correspondence, the pebble configurations that we get do not obey the rules of the 
black-white pebble game. Therefore, we are forced to to change the pebbling rules. In this section, 
we present the modified pebble game used for analyzing resolution derivations. 

Our first modification of the pebble game is to alter the rule for white pebble removal so that a 
white pebble can be removed from a vertex when a black pebble is placed on that same vertex. This 
will make the correspondence between pebblings and resolution derivations much more natural. 
Clearly, this is only a minor adjustment, and it is easy to prove formally that it does not really 
change anything. 

Our second, and far more substantial, modification of the pebble game is motivated by the fact 
that in general, a resolution refutation a priori has no reason to follow our pebble game intuition. 
Since pebbles are induced by clauses, if at some derivation step the refutation chooses to erase "the 
wrong clause" from the point of view of the induced pebble configuration, this can lead to peb- 
bles just disappearing. Whatever our translation from clauses to pebbles is, a resolution proof that 
suddenly out of spite erases practically all clauses must surely lead to practically all pebbles dis- 
appearing, if we want to maintain a correspondence between clause space and pebbling cost. This 
is all in order for black pebbles, but if we allow uncontrolled removal of white pebbles we cannot 
hope for any nontrivial lower bounds on pebbling price (just white-pebble the two predecessors of 
the sink, then black-pebble the sink itself and finally remove the white pebbles). 

Our solution to this problem is to keep track of exactly which white pebbles have been used 
to get a black pebble on a vertex. Loosely put, removing a white pebble from a vertex v without 
placing a black pebble on the same vertex should be in order, provided that all black pebbles placed 
on vertices above v in the DAG with the help of the white pebble on v are removed as well. We do 
the necessary bookkeeping by defining subconfigurations of pebble configurations, each subcon- 
figuration consisting of black pebble together with all the white pebbles this black pebble depends 
on, and require that if any pebble in a subconnguration is removed, then all other pebbles in this 
subconfiguration must be removed as well. 

Another problem is that resolution derivation steps can be made that appear intuitively bad given 
that we know that the end goal is to derive the empty clause, but where formally it appears where 
hard to nail down wherein this supposed badness lies. To analyze such apparently non-optimal 
derivation steps, we introduce an inflation rule in which a black pebble can be inflated to a blob 
covering multiple vertices. The way to think of this is that a black pebble on a vertex v corresponds 
to derived truth ov v, whereas for a blob pebble on V we only know that some vertex v € V is true, 
but not which one. For reasons that will perhaps become clearer in Sections [9] and [TOl in is natural 
to consider blobs that are chains (Definition [63]). 
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We now present the formal definition of the concept used to "label" each black blob pebble 
with the set of white pebbles (if any) this black pebble is dependent on. The intended meaning 
of the notation [2?](W) is a black blob on B together with the white pebbles W below v with the 
help of which we have been able to place the black blob on B. These "associated" or "supporting" 
white pebbles can be located on any vertex w £ B that can be visited by a source path P to top (5) 
agreeing with B. Formally, the legal pebble positions with respect to a chain B with b = bot(S) 
is the set of vertices 

l PP (B) = G\ U (\JVin(B) \ B) = U<Pvia(B) \ B . (6.1) 

We refer to the structure [B] (W) grouping together a black blob B and its associated white pebbles 
W as a blob subconfiguration, or just subconfiguration for short. 

Definition 6.7 (Blob subconfiguration). For sets of vertices B, W in a blob-pebblable DAG G, 

[B\(W) is a blob subconfiguration if B ^ is a chain and W C lpp(B). We refer to B as a 
(single) black blob and to W as (a number of different) white pebbles supporting B. We also say 
that B is dependent on W. If W = 0, B is independent. Blobs B with \B\ = 1 are said to be 
atomic. 

A set of blob subconfigurations § = |[Bj](Wj) | i = 1, . . . , m\ together constitute a blob- 
pebbling configuration. 

Note in particular that it always holds that B n W = for a blob subconfiguration [B](W). 
Since the definition of the game we will play with these blobs and pebbles is somewhat in- 
volved, let us first try to give an intuitive description. 

• There is one single rule corresponding to the two rules [T] and [3] for black and white pebble 
placement in the black- white pebble game of Definition 13.41 This introduction rule says that 
we can place a black pebble on a vertex v together with white pebbles on its predecessors 
(unless v is a source, in which case no white pebbles are needed). 

• The analogy for rule |2] for black pebble removal in Definition 13.41 is a rule for "shrinking" 
black blobs. A vertex v in a blob can be eliminated by merging two blob subconfigurations, 
provided that there is both a black blob and a white pebble on v, and provided that the two 
black blobs involved in this merger do not intersect the supporting white pebbles of one 
another in any other vertex than v. Removing black pebbles in the black-white pebble game 
corresponds to shrinking atomic black blobs. 

• A black blob can be inflated to cover more vertices, as long as it does not collide with its own 
supporting white vertices. Also, new supporting white pebbles can be added at an inflation 
move. There is no analogy of this move in the usual black-white pebble game. 

• The rule [4] for white pebble removal also corresponds to merging in the blob-pebble game, 
since the white pebble used in the merger is eliminated as well. In addition, however, a white 
pebble on w can also disappear if its black blob B changes so that w no longer can be visited 
on a path via B (i.e., if w is no longer a legal pebble position with respect to B). 

• Other than that, individual white pebbles, and individual black vertices covered by blobs, can 
never just disappear. If we want to remove a white pebble or parts of a black blob, we can do 
so only by erasing the whole blob subconfiguration. 

The formal definition follows. See Figure |5]for some examples of blob-pebbling moves. 



26 



6 A GAME FOR ANALYZING PEBBLING CONTRADICTIONS 




(a) Empty pyramid. (b) Introduction move. 




(c) Two subconfigurations before merger. (d) The merged subconfiguration. 




(e) Subconfiguration before inflation. (f) Subconfiguration after inflation. 




(g) Another subconfiguration before inflation. (h) After inflation with vanished white pebbles. 

Figure 5: Examples of moves in the blob-pebble game. 
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Definition 6.8 (Blob-pebble game). For a blob-pebblable DAG G and blob-pebbling configura- 
tions So an d §>r on G, a blob-pebbling from So to §r in G is a sequence V = {So, • • • >§t} of 
configurations such that for all t £ [r], St is obtained from St-i by one of the following rules: 

Introduction S t = S t -i U {[v](pred(v))}. 

Merger 8 t = St-i U {[B](W)} if there are [Bx](Wi), [B 2 ](W 2 ) E St_i such that 

1. 5i U B 2 is (totally) ordered, 

2. S a n W 2 = 0, 

3. \B 2 fl Wi| = 1; let t>* denote this unique element in £?2 n Wi, 

4. B = (B x U B 2 ) \ {«*}, and 

5. W = ((Wi U W 2 ) \ {v*}) n lpp(B), 

We write [£?](W) = merge([.Bi](Wi), [B 2 ](W 2 )) and refer to this as a merger on v*. 
Inflation S t = St-i U {[S](W» if there is a [S'](W) E § t -i such that 

1. 5 3 S', 

2. 5 n W = 0, and 

3. W D W n ipp(S). 

We say that [B] ( W) is derived from [B'](W) by inflation or that [B'\ (W) is inflated to yield 
Erasure § t = S t -i \ {[B}(W)} for E St_i. 

The blob-pebbling V is unconditional if So = an d conditional otherwise. A complete blob- 
pebbling of G is an unconditional pebbling V ending in S r = { [z](0) } for z the unique sink of G. 

6.3 Blob-Pebbling Price 

We have not yet defined what the price of a blob-pebbling is. The reason is that it is not a priori 
clear what the "correct" definition of blob-pebbling price should be. 

It should be pointed out that the blob-pebble game has no obvious intrinsic value — its function 
is to serve as a tool to prove lower bounds on the resolution refutation space of pebbling contra- 
dictions. The intended structure of our lower bound proof for resolution space is that we want 
look at resolution refutations of pebbling contradictions, interpret them in terms of blob-pebblings 
on the underlying graphs, and then translate lower bounds on the price of these blob-pebblings 
into lower bounds on the size of the corresponding clause configurations. Therefore, we have two 
requirements for the blob-pebbling price Blob-Peb(G): 

1. It should be sufficiently high to enable us to prove good lower bounds on Blob-Peb(G), 
preferrably by relating it to the standard black-white pebbling price BW-Peb(G). 

2. It should also be sufficiently low, so that lower bounds on Blob-Peb(G) translate back to 
lower bounds on the size of the clause configurations. 

So when defining pebbling price in Definition !6.9l below. we also have to have in mind the coming 
Definition 17.21 saying how we will interpret clauses in terms of blobs and pebbles and that these 
two definitions together should make it possible for us to lower-bound clause set size in terms of 
pebbling cost. 
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For black pebbles, we could try to charge 1 for each distinct blob. But this will not work, since 
then the second requirement above fails. For the translation of clauses to blobs and pebbles sketched 
in Section 12.31 it is possible to construct clause configurations that correspond to an exponential 
number of distinct black blobs measured in the clause set size. The other natural extreme seems 
to be to charge only for mutually disjoint black blobs. But this is far too generous, and the first 
requirement above fails. To get a trivial example of this, take any ordinary black pebbling of G 
and translate in into an (atomic) blob-pebbling, but then change it so that each black pebble [v] is 
immediately inflated to [{v, z}] after each introduction move. It is straightforward to verify that this 
would yield a pebbling of G in constant cost. For white pebbles, the first idea might be to charge 1 
for every white-pebbled vertex, just as in the standard pebble game. On closer inspection, though, 
this seems to be not quite what we need. 

The definition presented below turns out to give us both of the desired properties above, and 
allows us to prove an optimal bound. Namely, we define blob-pebbling price so as to charge 1 for 
each distinct bottom vertex among the black blobs, and so as to charge for the subset of supporting 
white pebbles W n G b A in a subconfiguration [B] (W) that are located below the bottom vertex 
bot(-B) of its black blob B. Multiple distinct blobs with the same bottom vertex come for free, 
however, and any supporting white pebbles above the bottom vertex of its own blob are also free, 
although we still have to keep track of them. 

Definition 6.9 (Blob-pebbling price). For a subconfiguration [.B](W), we say that B([B](W)) = 
{bot(B)} is the chargeable black vertex and that W A ([B](W)) = W n G^ ot ^ are the chargeable 
white vertices. The chargeable vertices of the subconfiguration [B] (W) are all vertices in the union 
B([B](W)) U W A ([B](W)). This definition is extended to blob-pebbling configurations § in the 
natural way by letting 

B(S) = (J B([B](W)) = (bot(B) | [B](W) £ S} 

[B]{W)& 

and 

W A (S) = \J W A ([B](W)) = [j (w n G h A ot(B) ^ . 

[B]{W)€§ [B]{W)£S 

The cost of a blob-pebbling configuration § is COSt(S) = \B(S) U W A (S)|, and the cost of a 
blob-pebbling V = {S , . . . ,S T } is cost(V) = max t6[r ] {cosf(Sj)}. 

The blob-pebbling price of a blob subconfiguration [B}(W), denoted Blob-Peb([B](W)), is 
the minimal cost of any unconditional blob-pebbling V = {§o> • • ■ > §v} such that S r = { [B] (W) } . 
The blob-pebbling price of a DAG G is Blob-Peb(G) = Blob-Peb{[z}{%)) , i.e., the minimal cost 
of any complete blob-pebbling of G. 

We will also write W(S) to denote the set of all white-pebbled vertices in S, including non- 
chargeable ones. 

7 Resolution Derivations Induce Blob-Pebblings 

For simplicity, in this section, as well as in the next one, we will write v \ , . . . , instead of 
x(v)i, . . . , x(v)d for the d variables associated with v in a dth degree pebbling contradiction. That 
is, in Sections |7] and [8] small letters with subscripts will denote only variables in prepositional logic 
and nothing else. 

It turns out that for technical reasons, it is more natural to ignore the target axioms z±, . . . ,Zd 
and focus on resolution derivations of Vf=i z i from the rest of the formula rather than resolution 
refutations of all of PebQ. Let us write *PebQ = PebQ \ \z\, . . . , z^} to denote the pebbling 
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formula over G with the target axioms in the pebbling contradiction removed. The next lemma is 
the formal statement saying that we may just as well study derivations of V?=i z l from this pebbling 
formula *Peb G instead of refutations of Peb G . 

Lemma 7.1. For any DAG G with sink z, it holds that Sp(Peb G h 0) = Sp{*Peb G V- \jf =1 z x ). 

Proof. For any resolution derivation -k* : *Peb G h Vf=i z i> we can § et a resolution refutation of 
Peb G from ir* in the same space by resolving Vf=i z i with all zj, I = 1, . . . , d, in space 3. 

In the other direction, for 7r : Pe&g. h we can extract a derivation of Vf=i z / i n at most the 
same space by simply omitting all downloads of and resolution steps on zi in 7r, leaving the literals 
Zi in the clauses. Instead of the final empty clause we get some clause D C \/f=i z i> an d since 
*Pe&c l^flC \/f = i zi and resolution is sound, we have D = Vf =i Zi- □ 

In view of Lemma 177X1 from now on we will only consider resolution derivations from *Peb G 
and try to convert clause configurations in such derivations into sets of blob subconfigurations. 

To avoid cluttering the notation with an excessive amount of brackets, we will sometimes use 
sloppy notation for sets. We will allow ourselves to omit curly brackets around singleton sets when 
this is clear from context, writing for instance V U v instead of V U {v} and [B U b](W U w) 
instead of [B U {fr}](l^ U {w}). Also, we will sometimes omit the curly brackets around sets of 
vertices in black blobs and write, for instance, [u, v] instead of [{u, v}]. 

7.1 Definition of Induced Configurations and Theorem Statement 

If r is a non-source vertex with predecessors pred(r) = {p, q}, we say that the axioms for r in 
*Peb% is the set 

Ax d (r) = {p.Vqj V Vf=i n \i,je [d]} (7.1) 

and if r is a source, we define Ax d (r) = {Vf=i r «}- F° r V a set of vertices in G, we let Ax d (V) = 
{Ax d (v) | v £ V). Note that with this notation, we have *Peb G = {Ax d (v) \ v £ V(G)}. For 
brevity, we introduce the shorthand notation 

Hv) = {y d =lVi \vev} (7.2) 

and 

All + (V)=\J veV \ff =1 v i . (7.3) 

One can think of B(V) as "truth of all vertices in V" and All + {V) as "truth of some vertex in V". 

We say that a set of clauses C implies a clause D minimally if C (= D but for all C' S C 
it holds that C ¥ D. If C N minimally, C is said to be minimally unsatisfiahle . We say that 
C implies a clause D maximally if C 1= D but for all D' ^ D it holds that C ¥■ D'. To define 
our translation of clauses to blob subconfigurations, we use implications that are in a sense both 
minimal and maximal. We remind the reader that the vertex set lpp(B) of legal pebble positions 
for white pebbles with respect to the chain B was defined in Equation (16.11 ) on page [26] 

Definition 7.2 (Induced blob subconfiguration). Let G be a blob-pebblable DAG and C a clause 
configuration derived from *Peb G . Then C induces the blob subconfiguration [i?](Vt / ) if there is a 
clause set Cb C C and a vertex set S C G \ B with W = S n lpp(B) such that 

C B U 1(5) t= All+(B) (7.4a) 

but for which it holds for all strict subsets C B C C B , S' C S and B' C B that 

C' B U M(S) ¥ All + {B) , (7.4b) 
C B U JB(S') Y- All + {B) , and (7.4c) 
C B U M(S) P All\B') . (7.4d) 
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We write S(C) to denote the set of all blob subconfigurations induced by C. 
To save space, when all conditions (l7.4aM7.4dl) hold, we write 

C B U M(S) > All + {B) (7.5) 

and refer to this as precise implication or say that the clause set Cb U M(S) implies the clause 
All + {B) precisely. Also, we say that the precise implication Cb U M(S) > AU + (B) witnesses the 
induced blob subconnguration [B](W). 

In the following, we will use the definition of precise implication > also for clauses All + {V) 
where the vertex set V is not a chain. 

Let us see that this definition agrees with the intuition presented in Section l23l An atomic black 
pebble on a single vertex v corresponds, as promised, to the fact that Vf=i v i * s implied by the cur- 
rent set of clauses. A black blob on V without supporting white pebbles is induced precisely when 
the disjunction All + (V) = V^eV Vf=i v i °f the corresponding clauses follow from the clauses in 
memory, but no disjunction over a strict subset of vertices V' ^ V is implied. Finally, the sup- 
porting white pebbles just indicate that if we indeed had the information corresponding to black 
pebbles on these vertices, the clause corresponding to the supported black blob could be derived. 
Remember that our cost measure does not take into account the size of blobs. This is natural since 
we are interested in clause space, and since large blobs, in an intuitive sense, corresponds to large 
(i.e., wide) clauses rather than many clauses. 

The main result of this section is as follows. 

Theorem 7.3. Let it = {Co, . . . , C r } be a resolution derivation of\J^ =1 Zifrom *PebQfor a blob- 
pebblable DAG G. Then the induced blob-pebbling configurations {S(Cq), • • • , §(C r )} form the 
"backbone" of a complete blob-pebbling V of G in the sense that 

• S(Co) = 0, 

• S(C T ) = {[z}{$)}, and 

• for every t S [r], the transition S(Ct-i) §(Ct) can be accomplished in accordance with 
the blob-pebbling rules in cost max{cosf(S(Ct-i)), COS?(S(Q))} + 0(1). 

In particular, to any resolution derivation it : *PebQ h Vf=i z i we can associate a complete blob- 
pebbling V-,, ofG such that COSt(P n ) < max Ce7r {cosf(§(C))} + 0(1). 

We prove the theorem by forward induction over the derivation n. By the pebbling rules in 
Definition 16. 8 1 any subconnguration may be erased freely at any time. Consequently, we 

need not worry about subconfigurations disappearing during the transition from Q_i to Q. What 
we do need to check, though, is that no subconnguration [B] (W) appears inexplicably in §(Q) as a 
result of a derivation step Q_i Q, but that we can always derive any [B] (W) G S(Ct) \S(Q_i) 
from §(Cf_i) by the blob-pebbling rules. Also, when several pebbling moves are needed to get 
from §(Cf) to S(Ct_i), we need to check that these intermediate moves do not affect the pebbling 
cost by more than an additive constant. 

The proof boils down to a case analysis of the different possibilities for the derivation step 
Cf_i ~^ Cf. Since the analysis is quite lengthy, we divide it into subsections. But first of all we 
need some technical lemmas. 

7.2 Some Technical Lemmas 

The next three lemmas are not hard, but will prove quite useful. We present the proofs for com- 
pleteness. 
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Lemma 7.4. Let C be a set of clauses and D a clause such that CND minimally and a G Lit(C) 
but a G" Lit(C). Then a G Lit(D). 

Proof. Suppose not. Let Ci = {C G C | a G Lii(C)} and C 2 = C \ Ci. Since C 2 ^ -D there 
is a truth value assignment a such that a(C 2 ) = 1 and a(D) = 0. Note that a{a) = 0, since 
otherwise a(Ci) = 1 which would contradict Ci U C 2 = C 1= D. It follows that a £ Lit(D). Flip 
a to true and denote the resulting truth value assignment by a a=1 . By construction a a=1 (Ci) = 1 
and C 2 and D are not affected since {a, a} n (Lit(C 2 ) U Lit(D)) = 0, so a a=1 (C) = 1 and 
a a=1 (L>) = 0. Contradiction. □ 

Lemma 7.5. Suppose that C, D are clauses and C is a set of clauses. Then C U {C} 1= D if and 
only ifC\=a\J D for all a G Lit(C). 

Proof. Assume that C U {C} N D and consider any assignment a such that a(C) = 1 and 
= (if there is no such a, then C\= D C a V D). Such an a must set C to false, i.e., all a 
to true. Conversely, if C t= a V D for all a G Lit(C) and a is such that a(C) = a(C) = 1, it must 
hold that a(D) = 1, since otherwise a(a V D) = for some literal a G Lit(C) satisfied by a. □ 

Lemma 7.6. Suppose that C \= D minimally. Then no literal from D can occur negated in C, i.e., 
it holds that {a \ a G Lit(D)} n £if(C) = 0. 

Proof. Suppose not. Let Ci = {C G C | 3a such that a G Lit(C) and a G Lit(D)} and C 2 = 
C \ Ci. Since C 2 ^ D there is an a such that a(C 2 ) = 1 and a(L>) = 0. But then a(C x ) = 1, 
since every C G Ci contains a negated literal a from D, and these literals are all set to true by a. 
Contradiction. □ 

We also need the following key technical lemma connecting implication with inflation moves. 

Lemma 7.7. Let C be a clause set derived from *PebQ. Suppose that B is a chain and that 
S C G\B is a vertex set such that C U JB(S) 1= All + (B) and letW = S(l lpp(B). Then the blob 
subconfiguration \B\(W) is derivable by inflation from some \B'\(W') G S(C). 

Proof. Pick C'CC,5'CS and B' C B minimal such that C U 1(5') N All + (B'). Then 
C U M(S') > >1// + (B') by definition. Note, furthermore, that B' ^ since the clause set on 
the left-hand side must be non-contradictory. Also, C / since B' n 5" C S n 5 = 0, 
so by Lemma El it cannot be that M(S') t= AU + (B'). This means that C induces for 
W' = 5' n Ipp(B'). We claim that [B'](W) can be inflated to [B]{W), from which the lemma 
follows. 

To verify this claim, note that first two conditions B' C B and S n W C S n 5 = for 
inflation moves in Definition 16. 81 clearly hold by construction. As to the third condition, we get 

W n ipp(B) = (s' n ipp(B')) n lpp(B) csn i PP (B) = w 

which proves the claim. □ 

We now start the case analysis in the proof of Theorem 17. 31 for the different possible derivation 
steps in a resolution derivation. 

7.3 Erasure 

Suppose that Q = Q_i \ {C} for C G C-t-x- It is easy to see that the only possible outcome 
of erasing clauses is that blob subconfigurations disappear. We note for future reference that this 
implies that the blob-pebbling cost decreases monotonically when going from S(Ct-i) to §(Q). 
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7.4 Inference 

Suppose that Q = Q_i U {C} for some clause C derived from No blob subconfigurations 
can disappear at an inference move since Q_i C C t . Suppose that is a new subcon- 

figuration at time t arising from Cg C Q_i and S C G \ B such that W = S n lpp(B) and 
C B U {C} U 1(5) D> AU + (B). Since C is derived from C t _i, we have Q_i N C. Thus it 
holds that C t _i U B(5) N All + (B) and Lemma tells us that [B]{W) is derivable by inflation 
fromS(C t _i). 

Since no subconfiguration disappears, the pebbling cost increases monotonically when going 
from S(Ct_i) to S(Q) for an inference step, which is again noted for future reference. 

7.5 Axiom Download 

This is the interesting case. Assume that a new blob subconfiguration [B] (W) is induced at time t 
as the result of a download of an axiom C G Ax (r). Then C must be one of the clauses inducing 
the subconfiguration, and we get that there are C Ct-i and S C G\B with W = S n lpp(B) 
such that 



Our intuition is that download of an axiom clause C G Ax (r) in the resolution derivation should 
correspond to an introduction of [r](pred(r)) in the induced blob-pebbling. We want to prove 
that any other blob subconfiguration [.£?]( in S(Q) is derivable by the pebbling rules from 
§(Cf_i) U [r](pred(r)). Also, we need to prove that the pebbling moves needed to go from 
§(Q_i) to S(Ct) do not increase the blob-pebbling cost by more than an additive constant com- 
pared to max{cosf(S(Ci_i)), COSf(S(Ct))} = COSt(S(C t )). 

We do the proof by a case analysis over r depending on where in the graph this vertex is located 
in relation to B. To simplify the proofs for the different cases, we first show a general technical 
lemma about pebble induction at axiom download. 

Lemma 7.8. Suppose that Q = Q_i U C for an axiom C 6 Ax d (r) and that [B](W) is a new 
blob subconfiguration induced at time t as witnessed by (17.6I ). Then it holds that: 

1. r£S. 

2. pred(r) nB = l 

3. If r £ B, then Ct-i induces [B](W U ({r} D lpp(B))) if r is a source, and otherwise this 
subconfiguration can be derived from S(Ct_x) by inflation. 

4. Ifr is a non-source vertex and v £ pred(r) is such that v G lpp(B) \ S, then we can derive 
[B U v](S n lpp(B U v)) from S(C t -i) by inflation. 

Proof. Suppose that \B)(W) G S(C t ) \ S(C*_i). For part ffl noting that IB (r) 1= C for C G Ax d (r) 
we see that r ^ S, as otherwise the implication (17.61 ) cannot be precise since C can be omitted. 

If r is a source part|2]is trivial, so suppose pred(r) = {p, q} and C = p i V ■ V Vf=i r i- Then 
it follows from Lemma I7T61 that {p, q} n B = 0. 

For part[3l if r is a source, we have C = Vf=i r i an ^ <I7.6I > becomes 



C B U {C} U B(5) > All\B) 



(7.6) 



C B U 1(5 U r) > AZ/+(B) 



(7.7) 



for 5 U r C G \ B, which shows that Ct_i induces 



[B]<(5 U r) n ipp(S)) = [B]<(5 n lpp(B)) U (r n lpp(B))} 

= [B]((W U (r n lpp(B))) . 



(7.8) 



33 



TOWARDS AN OPTIMAL SEPARATION 



If r is a non-source we do not get a precise implication but still have 

Cb U 1(5 U r) t= (7.9) 

and Lemma O yields that [B]((S U r) n lpp(B)) = [B](W U (r n lpp(B))) is derivable by 
inflation from §(Ct_i). 

If u G pred(r) in part|4j the downloaded axiom can be written on the form C = C V «j. 
Applying Lemma 1731 on (17.61 ) we get 

C B U 1(5) (= ,4Z/ + (£) V Vi C U u) . (7.10) 

By assumption, we have that B U u is a chain and that S C G \ (-B U t> ), so Lemma|7j7jsays that 
[5 U fl lpp(B U «)) is derivable from S(C t _i) by inflation. □ 

What we get from Lemma P7781 is not in itself sufficient to derive the new blob subconfiguration 
in the blob-pebble game, but the lemma provides subconfigurations that will be used as 
building blocks in the derivations of below. 

Now we are ready for the case analysis over the vertex r for the downloaded axiom clause 
G G Ax d (r). Recall that the assumption is that there exists a blob subconfiguration (H 7 ) G 
S(Ct) \ S(Ct_i) induced through for C B C Q_i and S C G \ B with W = S n lpp(B). 
Remember also that we want to explain all new subconfigurations in S(C$) \ S(Q_i) in terms of 
pebbling moves from S(Q) U {[r](pred(r))}. As illustrated in Figure[6j the cases for r are: 

1. r e G \ (G b A U |J qJin(B)) for 6 = bot(B), 

2. r€|J?Pin(-B)\5, 

3. r£B\{b} for 6 = bot(S), 

4. r = bot(-B), and 

5. reG^ for 6 = bot(B). 

7.5.1 Case 1 : r g G \ (G b A u U ^Pin(-B)) for 6 = bot(B) 

If r G G \ (G^ U (J ^Pin(S) ) , this means that the vertex r is outside the set of vertices covered by 
source paths via B to top(B). In other words, r ^ lpp(B) U i? and part[3]of Lemma[7JHyields that 
[-B](W U (r n lpp(B))) = [B](W] is derivable from S(C t _i) by inflation. Note that we need 
no intermediate subconfigurations in this case. 

7.5.2 Case 2: r G U 5Pta(-B) \ 5 

This is the first more challenging case, and we do it in some detail to show how the reasoning goes. 
The proofs for the rest of the cases are analogous and will be presented in slightly more condensed 
form. 

The condition r G U ty m {B) \ B says that the vertex r is located on some path from bot(-B) 
via B to top(-B) strictly above the bottom vertex b = bot(-B). In particular, this means that 
r cannot be a source vertex. Let pred(r) = {p,q} and denote the downloaded axiom clause 

C = p i yq j y\J d l= in. 

Part[3]of LemmaUJUsays that we can derive the blob subconfiguration 

[B](W U (r n lpp(B))) = [B](W U r) (7.11) 
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Figure 6: Cases for vertex r with respect to new black blob B at download of axiom C e Ax d (r). 

by inflation from §(Q_i), where the equality holds since r G (J ty m (B) \ B C lpp(B) by Defini- 
tion !6.7l Also, since r is on some path above b, at least one of the predecessors of r must be located 
on some path from b as well. That is, translating what was just said into our notation we have that 
the fact that r G [j ^ m (B) n G^ implies that either p G U Vin{B) or q G (J or both. By 

symmetry, we get two cases: p G |J *P; n (-B), q ^ (J ^Pi n (i?) and {p, C |J <P; n (£?). Let us look 
at them in order. 

I. p G U*Pi n (i?), g ^ |J*Pi n (i?): We make a subcase analysis depending on whether p G 
B U W or not. Recall from part [2] of Lemma I7~8l that p ^ B. The two remaining cases are 
p G W and p ^ 5 U W. 

(a) p G W: Let i; be the uppermost vertex in B below p, or in formal notation 

v = top(Gl n B) . (7.12) 

Such a vertex v must exist since p G (J*Pi n (i?) \ -B. Since p is above f and is a 
predecessor of r, it lies on some path from v to r, i.e., p G U <Pi n ({-u, r}) \ {v, r}. For 
the sibling q we have g ^ U^m({ u i r })- This is so since q ^ |J and for any 

path P G *Pi n ({w,r}) it holds that P C [j^} in (B) since there is nothing inbetween v 

and r in B, i.e., (U^inO,r}) \ {v,r}) n -B = 0. Also, q £ g\ ^ g\ because of 
the Sibling non-reachability property [6.21 Hence, it must hold that q ^ lpp({v, r}). 
We can use this information to make blob-pebbling moves resulting in [B] (W) as fol- 
lows. First introduce [r] (p, q) and inflate this subconfiguration to 

[v,r]({p,q} n lpp({v,r})) = [v,r](p) . (7.13) 

Then derive the subconfiguration [.B](W U r) in (17.111 ) by inflation from §(Q_i). 
Finally, merge the two subconfigurations (17.111) and (17.131) . The result of this merger 
move is [B U v](W U p) = [J3](W). 
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(b) p £ B U W: Note that p G ^m{B) \ B by assumption. Also, it must hold that p g S 
since otherwise we would get the contradiction p G 5 n (ty- m (B) \B) C Sn lpp(B) = 
W. Thus, p G lpp(B) \ S and part|4]of Lemma 17781 yields that we can derive the blob 
subconriguration 

[B U p](Wp) for W P QW (7.14) 

by inflation from S(Q_i), where W p = S D lpp(B U p) C 5 n Zpp(B) = W since 
lpp(B Up) C lpp{B) if p G |J ^Pi n (-B). (This last claim is easily verified directly from 
Definition O) 

With v = top(G^ n B) as in (17.121 ). introduce [r](p, q) and inflate to [v,r](p) as in 
(17- 13b - Merging the subconfigurations (17.131 ) and (17.141 ) yields 

[B U {v,r}]{W p ) = [B U r](W p ) (7.15) 

and a second merger of the resulting subconriguration (17.151 ) with the subconriguration 
in (177111 produces [B}(W U W p > = [B](W). 

This finishes the case p G U *p in (-B), <? g U Vto(-B)- 

{P, Q U ^Pin(-B): By part |2] of Lemma |T8]{p, q} n 5 = 0, so {p, g} C ^(B) \ £. By 
symmetry, we have the following subcases for p and q with respect to membership in B and 
W. 



(a) {p, g} C W, 

(b) p G W, q i W, 

(c) {p, q} n (B U W) = 0. 



[B U q](W q ) for W 9 C W 



(7.16) 



We analyze these subcases one by one. 

(a) {p, q} C W: This is easy. Just introduce [r] (p, g) and merge this subconriguration with 
the subconriguration (17.111 ) to get [B](W U {p, q}} = [B](W). 

(b) p £ W, q W: In this case it must hold that q ^ S since otherwise we would have 
q G S n OPin(-B) \ S) C S fl /pp(-B) = W contradicting the assumption. Thus 
q G OPin(B) \ B) \ S C Zpp(B) \ S and part|4]of LemmaO allows us to derive 



by inflation from S(Q_i). Here we have W q = S D lpp(B U q) C 5 n Zpp(-B) = W 
since lpp(B U g) C lpp(B) when g G |J 

Introduce [r] (p, g) and merge with the subconriguration (17.161 ) to get 

[BUr](W q Up) (7.17) 
and then merge (17.171 ) with [.B](W U r) from (17.111 ) to get [.B](W U W q U p) = 

(c) {p,?} n 8 U If = fl: Just as for the vertex g in case case lllbl here it holds for 
both p and q that {p, g} C lpp{B) \ S. Part |4] of Lemma 17781 yields subconfigurations 
[B U p\{W p ) for Wp C W as in (17141 and [B U g](W„) for lf,Clfas in (ITTtjl) 
derived by inflation from S(Q_i). 
Introduce [r] (p, q) and merge with (17.141 ) on p to get 

[jB U r](W p U ?) (7.18) 
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and then merge (17.181) with (17.161 ) on q resulting in 

[B U r]{W p U W q ) . (7.19) 
Finally, merge (17191 with (ITTTb on r to get [B]{W I) W p U W q ) = [B](W). 

This concludes the case r € U*Pi n (i?) \ -B. We can see that in all subcases, the new blob 
subconfiguration is derivable from S(Q_i) U [r](pred(r)) by inflation moves followed by 

mergers on some subset of {p, q, r}. 

Let us analyze the cost of deriving We want to bound the cost of the intermedi- 

ate subconfigurations that are used in the transition from to §(Q) but are not present 

in S(Q). We first note that for the subconfigurations [B](W U r), [B U p](W p ), [B U q](W q ) 
and [B U r] (W) for various W' C W , the chargeable vertices are all subsets of the chargeable 
vertices of the final subconfiguration [S](Vt / ). This is so since b = bot(-B) is the bottom vertex in 
all these black blobs, and all chargeable white vertices are contained in W n G\. The subconfigu- 
rations [r](p, q) and [v, r]{p) for v = top(G^ n B) can incur an extra cost, however, but this cost 
is clearly bounded by | {p, q, r,v}\ = 4. 

7.5.3 Case 3: r e B \ {b} for b = bot(B) 

First we note that in this case, we can no longer use part [3] of Lemma 17.81 to derive the blob sub- 
configuration U r) of (17- lib - The vertex r cannot be added to the support S since it is 
contained in B. Also, we note that r cannot be a source since it is above the bottom vertex b. As 
usual, let us write pred(r) = {p, q}. 

Observe that just as in case 2 (Section 17.5.21) we must have either p G [j^i n (B) or q 6 
U ty m (B) or both. By symmetry we get the same two cases for membership of p and q in \J V($ m (B), 
namely p £ [j ^ m (B), q(£\J % n (B) and {p, q} C |J qj ta (B). 

I. p € U*Pin(£), Q £ U^Pin(-B): As before, p £ B by part of Lemma El We make a 
subcase analysis depending on whether p £ W or p ^ B U W. 

As in (17.121 ) we let v = top(G^ n B) and note that p € |J ^Pi n ({w, r}) \ {v, r}. For g we 
have q ^ U^Pin({ u > r }) since q ^ U*P; n (i3) but {v,r} C |J<P; n (£?) and there is nothing 
inbetween v and r in B. Also, g ^ 5 because of the Sibling non-reachability 
property [6.21 Hence, it holds that q £ lpp({v, r}). 

(a) p G VK: Introduce [r](p, g), inflate [r](p, q) to [i>,r]({p, g} n lpp({v,r})) = [v,r](p) 
as in (17.131 ) and continue the inflation to [B U {u, r}](PF Up) = 

(b) p ^ 1? U M^: Just as in case 2, p ^ W implies p <^ S, so p £ lpp(B) \ S and we can 
use part 0] of Lemma 17781 to derive [B U p](W p ) for W p C as in (17.141) . Introduce 
[r](p, g), inflate to [v, r](p) as in (17.131) and merge (17.131) and (17.141 ) on p resulting in 
[B U {v,r}](W p } = [B] (W p ), which can be inflated to [B](W). 

II. {p, q} C |J *Pi n (5): We have the same possibilities to consider for containment of p and q 
in B U W as in case 2(HI]) on pagel36l 

(a) {p, q} C VF: This is immediate. Introduce the subconfiguration [r](p, q) and inflate to 

[B U r](W U {p,q}) = [B](W). 

(b) p £W, q (£ B U W: Apply part H of Lemma |T8] to derive [B U q](W q ) for W q CW 
by inflation from §(Ci_i). Then introduce [r](p, q) and merge on q to get the sub- 
configuration [B U r](W q Up) = [5](W g U p), which can be inflated further to 

[B){W q U p U W) = [B](W). 
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(c) {p, q] n (B U W) = 0: In the same way as in case lllbl derive the subconfigurations 
[B U p](W p ) and [B U q](W q ) with W p UW q QW from S(Q_i) by inflation. Intro- 
duce [r](p, q) and merge twice, first on p and then on q, to get [B](W P U W q ), which 
can be inflated to [B] (W). 

This concludes the case r G 5 \ {6}. We see that in all subcases the new blob subconfiguration 
[B] (W) is derivable from S(Q_i) U [r] (pred(r)} by inflation moves followed by mergers on some 
subset of {p, q}, possibly followed by one more inflation move. 

As in the previous case, the bottom vertex in all of the black blobs [B U p], [B U q] and 
[B U r] is b = bot(B), and the corresponding chargeable white pebbles are subsets of those of W. 
The extra cost caused by the subconfigurations [r] (p, q) and [v, r] (p) is at most 4. 

7.5.4 Case 4: r = bot(B) 

If r is a source, any [B](W) with r £ B can be derived by introducing [r](pred(r)) = [r](0) and 
inflating. Suppose therefore that r = bot(B) is not a source and let pred(r) = {p, q}. Then it 
holds that {p, q} C C lpp(B), i.e., the vertex sets B U p and B U g are both chains. 

By symmetry, we have three cases for p and g with respect to membership in W. (It is still true 
that {p, g} n B = by part|2]of Lemma|T8]) 

(a) {p, q} C VK: Immediate. Introduce [r](p,q) and inflate to [B U r](W U {p, g}) = 

(b) p G W, q £ W: Enlist the help of our old friend Lemma 17^81 partHl to derive [B U q](W q ) 
for W q OWby inflation from S(Q_i) (where C W holds since /pp(S U v) C /pp(5) 
if f G G^). Introduce [r](p, </) and merge with [5 U q](W q ) to get [i? U r](M / (? Up) = 
[B](W q U p). Then inflate [B](W q U p) to [B](W q U p U W) = [B]{W). 

(c) {p, g} n = 0: Following an established tradition, mimic case|b]and derive [B U p](W p } 
and [B U q](W q ) with 1^, U W, C W by inflation from S(C<_i). Introduce [r](p,q), do 
two mergers to get [B] (W p U W q ) and inflate to [B] (W). 

This takes care of the case r = b. Again, in all subcases our new subconfiguration [B](M / ) is 
derivable from S(Ct_i) U [r](pred(r)} by inflation moves followed by mergers on some subset of 
{p, q}, possibly followed by one more inflation move. 

This time the blobs [B U p] and [B U q] can cause an extra intermediate cost of 1 each for 
the bottom vertices p and q, and [r] (p, q) potentially adds an extra cost 1 for r, giving that the 
intermediate extra cost is bounded by 3. 

7.5.5 Case 5: r e G k A for b = bot(B) 

This final case is very similar to the previous case r = hot(B). Note first that r G g\ C lpp(B). 
If r is a source, then C = Vj=i r « an ^ we nave 

C B U {C} U B(5) = C b U 1(5 U r) > ^H+(B) (7.20) 

at time t — 1, which shows that [B](W U r) G S(Ct_i). Hence, we can introduce [r](pned(r)) = 
[r] (0) and merge on r to get [B] (W). 

As usual, the more interesting case is when r is a non-source with pred(r) = {p, q}. The case 
analysis is just as in case 4 (Section [7.5.4b . However, note that now we can again use part [3] of 
Lemma 17781 to derive [B](W U r) from S(Ct_i) by inflation since it holds that r ^ B. 

(a) {p, q} C W: Introducing [r](p,q) and merging with [B](TV U r) yields [B](W). 
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(b) p G W, q W: Appeal to partl4lof LemmarT8lto get [B U q]{W q ) for W q C W by inflation 
from S(Ct-i). Introduce [r](p, q) and merge to get [B U r](W g U p), and merge again with 
[B](W U r) to get [B](W>. 

(c) {p, n = 0: As in case [b] above for g, derive [B U p](Wp) and [B U g](W g ) with 
W p U W q C W by inflation from §(Cf_i). Introduce [r](p, g) and do two mergers to get 
[B U r](Wp U W q ). Finally merge [J3 U r)(W p U W 9 ) with [B](W U r) to get [B](W). 

This takes care of the case r = g\. We note that in all subcases of this case, [B] (W) is derivable 
from S(Ct-i) U [r](pred(r)) by inflation moves followed by mergers on some subset of {p, q, r}. 
Again, the extra intermediate pebbling cost is bounded by \{p, q,r}\ = 3. 

7.6 Wrapping up the Proof 

If 7r = {Co, . . . , C T } is a derivation of Vf=i z i from *Pebfj, it is easily verified from Definition |T2] 

that S(Co) = S(0) = and S(C T ) = S({Vti *}) = {MWI- 

In Sections [7.31 17.41 and 17.51 we have shown how to do the intermediate blob-pebbling moves 
to get from §(Cf_i) to S(Ct) in the case of erasure, inference and axiom download, respectively. 
For erasure and inference, the blob-pebbling cost changes monotonically during the transition 
§(Cf_i) ~-> S(Q). In the case of axiom download, there can be an extra cost of 4 incurred for 
deriving each [B](W) G S(Q) \ §(Q_i). We have no a priori upper bound on |S(Q) \ S(Q_i)|, 
but if we just derive the new subconfigurations one by one and erase all intermediate subconfigu- 
rations inbetween these derivations, we will keep the total extra cost below 4. 

This shows that the complete blob-pebbling of G associated to a resolution derivation 
7r : *PebQ h Vi=i z i by the construction in this section has blob-pebbling cost bounded from above 
by COSt(V w ) < maxc£7r{cosf(S(C))} + 4. Theorem 1731 is thereby proven. 

8 Induced Blob Configurations Measure Clause Set Size 

In this section we prove that if a set of clauses C induces a blob-pebbling configuration 8(C) 
according to Definition 17.21 then the cost of 8(C) as specified in Definition l6.9l is at most |C|. That 
is, the cost of an induced blob-pebbling configuration provides a lower bound on the size of the set 
of clauses inducing it. This is Theorem 18.51 below. 

Note that we cannot expect a proof of this fact to work regardless of the pebbling degree d. The 
induced blob-pebbling in Section |7J makes no assumptions about d, but for first-degree pebbling 
contradictions we know that Sp(*Pebl> h z-y) = Sp{Peb l G h 0) = 0(1). Provided d > 2, though, 
we show that one has to pay at least |C| > N clauses to get an induced blob-pebbling configuration 
of cost N. 

We introduce some notation to simply the proofs in what follows. Let us define Vars d (u) = 
{u\, . . . , Ud}. We say that a vertex u is represented in a clause C derived from *PebQ, or that C 
mentions u, if Vars d (u) Pi Vars(C) ^ 0. We write 

V(C) = {u G V(G)\ Vars d {u) n Vars{C) / 0} (8.1) 

to denote all vertices represented in C. We will also refer to V(C) as the set of vertices mentioned 
by C. This notation is extended to sets of clauses by taking unions. Furthermore, we write 

C\U\ = {C g C I V(C) n U + 0} (8.2) 

to denote the subset of all clauses in C mentioning vertices in a vertex set U. 
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We now show some technical results about CNF formulas that will come in handy in the proof 
of Theorem l8.5l Intuitively, we will use Lemma [8TT1 below together with Lemma 7.4 on page 32 to 
argue that if a clause set C induces a lot of subconngurations, then there must be a lot of variable 
occurrences in C for variables corresponding to these vertices. Note, however, that this alone will 
not be enough, since this will be true also for pebbling degree d = 1. 

Lemma 8.1. Suppose for a set of clauses C and clauses D\ and D 2 with Vars(Di) n Vars(D2) = 
that C 1= Di V D 2 but C ¥■ D 2 . Then there is a literal a £ Lit(C) D Lit{Dx). 

Proof. Pick a truth value assignment a such that a(C) = 1 but 01(1)2) = 0. Since C N D, we must 
have ct{D\) = 1. Let a' be the same assignment except that all satisfied literals in D\ are flipped to 
false (which is possible since they are all strictly distinct by assumption). Then a'{D\ V D 2 ) = 
forces a'(C) = 0, so the flip must have falsified some previously satisfied clause in C. □ 

The fact that a minimally unsatisfiable CNF formula must have more clauses than variables 
seems to have been proven independently a number of times (see, for instance, [Q] |20l 38 ]). 
We will need the following formulation of this result, relating subsets of variables in a minimally 
implicating CNF formula and the clauses containing variables from these subsets. 

Theorem 8.2. Suppose that F is CNF formula that implies a clause D minimally. For any subset 
of variables V of F, let Fy = {C £ F \ Vars(C) n V 7^ 0} denote the set of clauses containing 
variables from V. Then ifV C Vars(F) \ Vars(D), it holds that \Fy \ > \V\. In particular, if F 
is a minimally unsatisfiable CNF formula, we have \Fy\ > \ V\for all V C Vars(F). 

Proof. The proof is by induction over V C Vars(F) \ Vars(D). 

The base case is easy. If \V \ = 1, then \Fy\ > 2, since any x € V must occur both unnegated 
and negated in F by Lemma UA\ 

The inductive step just generalizes the proof of Lemma l7~4l Suppose that \Fy\ > \V'\ for 
all strict subsets V g V C Vars(F) \ Vars(D) and consider V. Since F v > C Fy if V C V, 
choosing any V of size \V\ — 1 we see that \Fy\ > \Fy \ > \V'\ + 1 = \V\. 

If \Fy\ > \ V\ there is nothing to prove, so assume that \Fy\ = \ V\. Consider the bipartite graph 
with the variables V and the clauses in Fy as vertices, and edges between variables and clauses for 
all variable occurrences. Since for all V' C V the set of neighbours N(V') = Fy C Fy satisfies 
— l^'l' by Hall's marriage theorem there is a perfect matching between V and Fy. Use 
this matching to satisfy Fy assigning values to variables in V only. 

The clauses in F' = F \ Fy are not affected by this partial truth value assignment, since they 
do not contain any occurrences of variables in V. Furthermore, by the minimality of F it must 
hold that F' can be satisfied and D falsified simultaneously by assigning values to variables in 
Vars(F') \ V. 

The two partial truth value assignments above can be combined to an assignment that satisfies 
all of F but falsifies D, which is a contradiction. Thus \Fy\ > \V\. The theorem follows by 
induction. □ 

Continuing our intuitive argument, given that Lemmas lT!4l and [8Tl tell us that many induced sub- 
configurations implies the presence of many variables in C, we will use Theorem l8.2l to demonstrate 
that a lot of different variable occurrences will have to translate into a lot of different clauses 
provided that the pebbling degree d is at least 2. Before we prove this formally, let us try to 
provide some intuition for why it should be true by studying two special cases. Recall the notation 
B(V) = {\/ im Vi\veV} and All + {V) = \J veV \f ie[d[ v t from Section[7] 

Example 8.3. Suppose that C is a clause set derived from *Peb d G that induces N independent black 
blobs Bi, . . . , Bn that are pairwise disjoint, i.e., Bi n Bj = if i / j. Then the implications 

C t= All + (Bi) (8.3) 
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hold for i = 1, . . . , N. Remember that since *PebQ is non-contradictory, so is C. 

It is clear that a non-contradictory clause set C satisfying (18.31 ) for i = 1, . . . , N is quite simply 
the set 

C = {AU + (Bi)\i = 1, . . . N} (8.4) 

consisting precisely of the clauses implied. Also, it seems plausible that this is the best one can 
do. Informally, if there would be strictly fewer clauses than N, some clause would have to mix 
variables from different blobs Bi and Bj. But then Lemma 17741 says that there will be extra clauses 
needed to "neutralize" the literals from Bj in the implication C 1= All + (Bi) and vice versa, so that 
the total number of clauses would have to be strictly greater than N. 

As it turns out, the proof that |C| > N when C induces N pairwise disjoint and independent 
black blobs is very easy. Suppose on the contrary that (18.31 ) holds for i = 1,...,N but that 
|C| < N. Let a be a satisfying assignment for C. Choose a' C a to be any minimal partial truth 
value assignment fixing C to true. Then for the size of the domain of a' we have |Dom(a')| < N, 
since at most one distinct literal is needed for every clause C £ C to fix it to true. This means 
that there is some Bi such that a' does not set any variables in Vars d (Bi). Consequently a' can be 
extended to an assignment a" setting C to true but All + {Bi) to false, which is a contradiction. With 
some more work, and using Theorem 18.21 one can show that |C| > N if variables from distinct 
blobs are mixed. 

Note that the above argument works for any pebbling degree including d = 1. Intuitively, this 
means that one can charge for black blobs even in the case of first degree pebbling formulas. 

Example 8.4. Suppose that the clause set C induces an blob subconfiguration [B] (W) with W 7^ 0, 
and let us assume for simplicity that C is minimal and W = S so that the implication 

C U B(W) t= All + {B) (8.5) 

holds and is minimal. We claim that |C| > \W\ + 1 provided that d > 1. 

Since by definition B n W = we have Vars(AU + (B)) n Vars(B(W)) = 0, and Theorem^ 
yields that |C U M(W)\ > |C[W] U M(W)\ > \ Vars(M(W))\, using the notation from dO). This 
is not quite what we want — we have a lower bound on |C U B(W)|, but what we need is a bound 
on |C|. But if we observe that | Vars(M(W))\ = d\W\ while |B(W)| = \ W\, we get that 

|C| > I Vars(B(W))\ - \U{W)\ + 1 = (d - 1)\W\ + 1 > \W\ + 1 (8.6) 

as claimed. 

We remark that this time we had to use that d > 1 in order to get a lower bound on the clause 
set size. And indeed, it is not hard to see that a single clause on the form C = v\\/ VmeW ^ 1 can 
induce an arbitrary number of white pebbles if d = 1. Intuitively, white pebbles can be had for free 
in first degree pebbling formulas. 

In general, matters are more complicated than in Examples 18.31 and 18.41 If [i?i](W r i) and 
[i?2](W / 2) are two induced blob subconfigurations, the black blobs B\ and B2 need not be dis- 
joint, the supporting white pebbles W\ and W2 might also intersect, and the black blob B\ can 
intersect the supporting white pebbles W2 of the other blob. Nevertheless, if we choose with some 
care which vertices to charge for, the intuition provided by our examples can still be used to prove 
the following theorem. 

Theorem 8.5. Suppose that G is a blob-pebblable DAG and let C be a set of clauses derived from 
the pebbling formula *Peb%ford> 2. Then |C| > COSf(S(C)). 

Proof. Suppose that the induced set of blob subconfigurations is S(C) = { [Bi](Wi) \i € [m]}. By 
Definition 15791 we have COSt{S(C)) = \B U W A | where 

B = {bot(Bi)\[Bi]{Wi) G S(C)} (8.7) 
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and 

W A = |J (w t n G^ ot(Bi) ) . (8.8) 
[Bi]{Wi>e§(C) 

We need to prove that |C| > \B U W A |. 

We first show that all vertices in B U W A are represented in some clause in C. By Definition l7.2[ 
for each [Bj](Wi) G 8(C) there is a clause set Q C C and a vertex set Si C G \ Bi with 
Wj = Si n lpp{Bi) C S'j such that 

Ci U B(Si) N 4H+(£j) (8.9) 

and such that this implication does not hold for any strict subset of Q, Si or l?j. Fix (arbitrarily) 
such Cf and Si for every [i?j](Wj) G S(C) for the rest of this proof. 

For the induced black blobs Bi we claim that Bi C V(Q), which certainly implies bot(-Bj) G 
V(C). To establish this claim, note that for any v G Bi we can apply Lemma |8~T1 with L>i = 
Vj=i v j an d ^2 = AU + (Bi \ {v}) on the implication (18.91 ). which yields that the vertex v must be 
represented in Cj U B(Wj) by some positive literal vj. Since Bi n Si = 0, we have Vars(B(Sj)) n 
Vars(All + (Bi)) = and thus vj G Lit(d). 

Also, we claim that S$ C V(Q). To see this, note that since B 8 n 5; = and the implica- 
tion (18.91 ) is minimal, it follows from Lemma lT4l that for every w G Si, all literals wj, j G [d], must 
be present in Q. Thus, in particular, it holds that Wj Pi (7^ ot ( Si ) c V(Q). 

We now prove by induction over subsets R C £> U W A that |C Ji2j | > |JS|. The theorem clearly 
follows from this since |C| > |C[i?] |. (The reader can think of R as the set of vertices representing 
the blob-pebbling configurations [£>,;] (Wj) G S(C) in the clause set C.) 

The base case \R\ = 1 is immediate, since we just demonstrated that all vertices r G R are 
represented in C. 

For the induction step, suppose that |C[i?'J| > \R'\ for all R' ^ R. Pick a "topmost" vertex 
r G R, i.e., such that G? n R = 0. We associate a blob subconfiguration \Bi]{Wi) G S(C) with 
r as follows. If r = bot(-Bj) for some [Bi](Wi), fix [Bi](Wi) arbitrarily to such a subconfigu- 
ration. Otherwise, there must exist some [Bj](Wj) such that r G W« n G^ ot ^ ! \ so fix any such 
subconfiguration. We note that it holds that 

R n G% om) C {r} (8.10) 

for [Bj](Wj) chosen in this way. 

Consider the clause set Q C C and vertex set Si 5 Wj from (18.91 ) associated with [Bj](Wj) 
above. Clearly, by construction r G V"(Q) is one of the vertices of R mentioned by Q. We claim 
that the total number of vertices in R mentioned by Q is upper-bounded by the number of clauses 
in Cj mentioning these vertices, i.e., that 

|Qp?j| > \R n V(Q)| . (8.11) 

Let us first see that this claim is sufficient to prove the theorem. To this end, let 

R[i] =RH V(Ci) (8.12) 

denote the set of all vertices in R mentioned by Q and assume that |Q[i2]| = > \R[i}\- 

Observe that Cjji?[f]]] C C[i?], since Q C C and R[i] C R. Or in words: the set of clauses in 
Cj mentioning vertices in R[i] is certainly a subset of all clauses in C mentioning any vertex in R. 
Also, by construction Cj does not mention any vertices in R \ R[i] since R[i] = R n V(Cj). That 
is, 

C[i2\i2[i]]CC[i2]\Q (8.13) 
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in our notation. Combining the (yet unproven) claim (18.111 ) for = Q[[i2[i]]] asserting that 

ICid-R^J I > \R[i]\ with the induction hypothesis for R \ R[i] C R \ {r} g R we get 



\C{R\\ = \Ci\R^(C\{ 

> \CilRnV(Ci)luClR\V(Ci)}\ 

= \CilR\i]}\ + \ClR\R[{\}\ (8.14) 
>\R[i]\ + \R\R\i}\ 

= \R\ 

and the theorem follows by induction. 

It remains to verify the claim (fHTTTb that |Ci[i2[i]]| > \R[i]\ for R[i] = R n V{d) ^ 0. To do 
so, recall first that r G R[i\. Thus, / and if R[i] = {r} we trivially have > 1 = 

\R[i]\- Suppose therefore that R[i] 2 {r}. 

We want to apply Theorem 18.21 on the formula F = Cj U B(Sj) on the left-hand side of the 
minimal implication (l8~79l Let R' = R[i] \ {r}, write R' = Rx U i? 2 for R\ = R' n 5, and 
R2 = R' \ Ri, and consider the subformula 

= {C G (Cj U B(5j)) I V(C) n i?' / 0} 



Qli?'] U M(R 



of F = Cj U B(5j). A key observation for the concluding part of the argument is that by (18.101 ) 
we have Vars d {R') n Vars(All+(Bi)) = 0. 

For each w G R\, the clauses in B(i?i) contain d literals w\, . . . , and these literals must 
all occur negated in Cj by Lemma [7741 For each u G R2, the clauses in Cj[i?'] contain at least 
one variable Ui. Appealing to Theorem l8.2l with the subset of variables Vars d (R') D Vars(Ci) C 
Vars(F) \ Vars(AU + (Bi)), we get 

\F R ,\ = |Qp?'] U B(fl0| 

> I Vars d (R') n Vars(Q)| + 1 (8.16) 

> + |i? 2 | + 1 , 

and rewriting this as 

\CilR[i\}\ > \QiR'}\ 

= \F R A - \M(Ri)\ 

M i. 1 (8-17) 

> (d-l)|fli| + \R 2 \ + 1 

> \m\ 

establishes the claim. □ 



We have two concluding remarks. Firstly, we note that the place where the condition d > 2 is 
needed is the very final step (18- 17b - This is where an attempted lower bound proof for first degree 
pebbling formulas *PebQ would fail for the reason that the presence of many white pebbles in S(C) 
says absolutely nothing about the size of the clause set C inducing these pebbles. Secondly, another 
crucial step in the proof is that we can choose our representative vertices r G R so that (18.101 ) holds. 
It is thanks to this fact that the inequalities in (18.161 ) go through. The way we make sure that (18.101 ) 
holds is to charge only for (distinct) bottom vertices in the black blobs, and only for supporting 
white pebbles below these bottom vertices. 



43 



TOWARDS AN OPTIMAL SEPARATION 



9 Black-White Pebbling and Layered Graphs 

Having come this far in the paper, we know that resolution derivations induce blob-pebblings. We 
also know that blob-pebbling cost gives a lower bound on clause set size and hence on the space of 
the derivation. The final component needed to make the proof of Theorem 11.11 complete is to show 
lower bounds on the blob-pebbling price Blob-Peb(Gi) for some nice family of blob-pebblable 
DAGs Gi. 

Perhaps the first idea that comes to mind is to try to establish lower bounds on blob-pebbling 
price by reducing this problem to the problem of proving lower bounds for the standard black- 
white pebble game of Definition 13.41 This is what is done in [42] for the restricted case of trees. 
There, for the pebblings V n that one gets from resolution derivations tt : *Peb^ h Vf=i z i m a ratner 
different so-called "labelled" pebble game, an explicit procedure is presented to transform V w into a 
complete black-white pebblings of T in asymptotically the same cost. The lower bound on pebbling 
price in the labelled pebbel game then follows immediately by using the known lower bound for 
black- white pebbling of trees in Theorem 14. 81 

Unfortunately, the blob-pebble game seems more difficult than the game in ll42l to analyze in 
terms of the standard black-white pebble game. The problem is the inflation rule (in combination 
with the cost function). It is not hard to show that without inflation, the blob-pebble game is 
essentially just a disguised form of black-white pebbling. Thus, if we could convert any blob- 
pebbling into an equivalent pebbling not using inflation moves without increasing the cost by more 
than, say, some constant factor, we would be done. But in contrast to the case for the labelled 
pebble game in 11421 played on binary trees, we are currently not able to transform blob-pebblings 
into black-white pebblings in a cost-preserving way. 

Instead, what we do is to prove lower bounds directly for the blob-pebble game. This is not 
immediately clear how to do, since the lower bound proofs for black-white pebbling price in, for 
instance, ll24l I3T1 l37l l39l all break down for the more general blob-pebble game. We are currently 
able to obtain lower bounds only for the limited class of layered spreading graphs (to be defined 
below), a class that includes binary trees and pyramid graphs. In our proof, we borrow heavily from 
the corresponding bound for black- white pebbling in lf37ll . but we need to go quite deep into the 
construction in order to make the changes necessary for the proof go through in the blob-pebbling 
case. In this section, we therefore give a detailed exposition of the lower bound in 071 . in the 
process simplifying the proof somewhat. In the next section we build on this result to generalize 
the bound from the black- white pebble game to the blob-pebble game in Definition 16. 81 

9.1 Some Preliminaries and a Tight Bound for Black Pebbling 

Unless otherwise stated, in the following G denotes a layered DAG; u, v, w, x, y denote vertices 
of G; U, V, W, X, Y denote sets of vertices; P denotes a path; and <p denotes a set of paths. We 
will also use the following notation. 

Definition 9.1 (Layered DAG notation). For a vertex u in a layered DAG G we let level (u) 
denote the level of u. For a vertex set U we let minlevel(C/) = min{level(u) : u G U} and 
maxlevel(C/) = max{level(u) :«£[/} denote the lowest and highest level, respectively, of any 
vertex in U. Vertices in U on particular levels are denoted as follows: 

• U{>: j} = {u £ U | level(w) > j} denotes the subset of all vertices in U on level j or higher. 

• U{>~j} = {u 6 U | level (u) > j} denotes the vertices in U strictly above level j. 

• U{~j} = U{>zj} \ U{>- j} denotes the vertices exactly on level j. 
The vertex sets U{-<j} and U{~<j} are defined wholly analogously. 
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1 2 3 4 ••• h h+1 
(b) Pyramid as fragment of 2D rectilinear lattice. 



(a) Pyramid graph of height h = 6. 

Figure 7: The pyramid n 6 of height 6 with labelled vertices 



For the layered DAGs G under consideration we will assume that all sources are on level 0, 
that all non-sources have indegree 2, and that there is a a unique sink z. Since all layered DAGs 
also possess the Sibling non-reachability property 16.21 this means that we are considering blob- 
pebblable DAGs (Definition l6.6l ). and so the blob-pebble game can be played on them. 

Although most of what will be said in what follows holds for arbitrary layered DAGs, we will 
focus on pyramids since these are the graphs that we are most interested in. Figure [7(a)1 presents a 
pyramid graph with labelled vertices that we will use as a running example. Pyramid graphs can 
also be visualized as triangular fragments of a directed two-dimensional rectilinear lattice. Perhaps 
this can sometimes make it easier for the reader to see that "obvious" statements about properties of 
pyramids in some of the proofs below are indeed obvious. In Figure [7(b)| the pyramid in Figure [7(a)1 
is redrawn as such a lattice fragment. 

In the standard black and black-white pebble games, we have the following upper bounds on 
pebbling price of layered DAGs. 

Lemma 9.2. For any layered DAG Gh of height h with a unique sink z and all non-sources having 
vertex indegree 2, it holds that Peb(G h ) <h + 0(1) and BW-Peb(G h ) < h/2 + 0(1). 



Proof. The bounds above are true for complete binary trees of height h according to Theorem 14. 81 
It is not hard to see that the corresponding pebbling strategies can be used to pebble any layered 
graph of the same height with at most the same amount of pebbles. 

Formally, suppose that the sink z of the DAG Gh has predecessors x and y. Label the root of 
Th by z\ and its predecessors by x\ and y\. Recursively, for a vertex in Th labelled by Wi, look 
at the corresponding vertex w in Gh and suppose that pred(w) = {u, v}. Then label the vertices 
pred(wi) in Th by Uj and Vk for the smallest positive indices j, k such that there are not already 
other vertices in Th labelled Uj and Vk- In Figure [8] there is an illustration of how the vertices in a 
pyramid 1I3 of height 3 are mapped to vertices in the complete binary tree T3 in this manner. 

The result is a labelling of Th where every vertex v in Gh corresponds to one or more distinct 
vertices V\,... ,Vk v in Th, and such that if pred(wi) = {uj,Vk} in Th, then pred(w) = {u, v} 
in Gh- Given a pebbling strategy V for Th, we can pebble Gh with at most the same amount of 
pebbles by mimicking any move on any in Th by performing the same move on v in Gh- The 
details are easily verified. □ 



45 



TOWARDS AN OPTIMAL SEPARATION 




(a) Pyramid graph n 3 of height 3. (b) Binary tree T 3 with vertex labels from n 3 . 



Figure 8: Binary tree with vertices labelled by pyramid graph vertices as in proof of Lemma l9?2l 

In this section, we will identify some layered graphs Gh for which the bound in Lemma |9T2~1 is 
also the asymptotically correct lower bound. As a warm-up, and also to introduce some important 
ideas, let us consider the black pebbling price of the pyramid 11^ of height h. 

Theorem 9.3 (1122V). Peb(TL h ) = h + 2 for h>l. 

To prove this lower bound, it turns out that it is sufficient to study blocked paths in the pyramid. 

Definition 9.4. A vertex set U blocks a path P if U n P ^ 0. U blocks a set of paths *p if U 
blocks all P G <p. 

Proof of Theorem EOI It is easy to devise (inductively) a black pebbling strategy that uses h + 2 
pebbles (using, for instance, Lemma |9~2l ). We show that this is also a lower bound. 

Consider the first time t when all possible paths from sources to the sink are blocked by black 
pebbles. Suppose that P is (one of) the last path(s) blocked. Obviously, P is blocked by placing 
a pebble on some source vertex u. The path P contains h + 1 vertices, and for each vertex v G 
P \ {u} there is a unique path P v that coincides with P from v onwards to the sink but arrives at 
v in a straight line from a source "in the opposite direction" of that of P, i.e., via the immediate 
predecessor of v not contained in P. At time t — 1 all such paths {P v | v G P \ {u}} must already 
be blocked, and since P is still open no pebble can block two paths P v ^ P v i for v, v' G P \ {u}, 
v 7^ v' . Thus at time t there are at least h+l pebbles on Il/j. Furthermore, without loss of generality 
each pebble placement on a source vertex is followed by another pebble placement (otherwise 
perform all removals immediately following after time t before making the pebble placement at 
time t). Thus at time t + 1 there are h + 2 pebbles on 11^. □ 

We will use the idea in the proof above about a set of paths converging at different levels to 
another fixed path repeatedly, so we write it down as a separate observation. 

Observation 9.5. Suppose that u and w are vertices in TL^ on levels L u < L w and that P : u w 
is a path from u to w. Let K = L w — L v and write P = {v$ = u,v±, . . . ,vk = w}. Then 
there is a set of K paths *p = {Pi, ... , Pk} such that Pi coincides with P from Vi onwards to w 
arrives to V{ in a straight line from a source vertex via the immediate predecessor ofvi which is not 
contained in P, i.e., is distinct from Vi—\. In particular, for any i,j with l<i<j<kit holds 
that Pi n PjC Pj n P C P \ {u}. 

We will refer to the paths Pi, ... , Pk as a set of converging source paths, or just converging 
paths, for P : u ~» w. See Figure|9]for an example. 
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Figure 9: Set of converging source paths (dashed) for the path P : u 4 ~* y\ (solid). 
9.2 A Tight Bound on the Black-White Pebbling Price of Pyramids 

The rest of this section contains an exposition of Klawe 11371 . with some simplifications of the 
proofs. Much of the notation and terminology has been changed from 11371 to fit better with this 
paper in general and (in the next section) the blob-pebble game in particular. Also, it should be 
noted that we restrict all definitions to layered graphs, in contrast to Klawe who deals with a some- 
what more general class of graphs. We concentrate on layered graphs mainly to avoid unnecessary 
complications in the exposition, and since it can be proven that no graphs in 11371 can give a better 
size/pebbling price trade-off than one gets for layered graphs anyway. 

Recall from Definition 16 . 5 1 that a path via w is a path P such that w G P. We will also say that 
P visits w. The notation ^p v ia(w) is used to denote all source paths visiting w. Note that a path 
P G *p v ia(^) visiting w may continue after w, or may end in w. 

Definition 9.6 (Hiding set). A vertex set U hides a vertex w if U blocks all source paths visiting w, 
i.e., if U blocks *p v i a (u>). U hides W if U hides all w G W. If so, we say that U is a hiding set 
for W. We write \\U~\\ to denote the set of all vertices hidden by U. 

Our perspective is that we are standing at the sources of G and looking towards the sink. Then 
U hides w if we "cannot see" w from the sources since U completely hides w. When U blocks 
a path P is is possible that we can "see" the beginning of the path, but we cannot walk all of the 
path since it is blocked somewhere on the way. The reason why this terminological distinction is 
convenient will become clearer in the next section. 

Note that if U should hide w, then in particular it must block all paths ending in w. Therefore, 
when looking at minimal hiding sets we can assume without loss of generality that no vertex in U 
is on a level higher than w. 

It is an easy exercise to show that the hiding relation is transitive, i.e., that if U hides V and V 
hides W, then U hides W. 

Proposition 9.7. IfVC [ft/]] and W C [[V"}] then W C ([[/]]. 

One key concept in Klawe's paper is that of potential. The potential of P = (B, W) is intended 
to measure how "good" the configuration P is, or at least how hard it is to reach in a pebbling. 
Note that this is not captured by the cost of the current pebble configuration. For instance, the final 
configuration P r = ({z}, 0) is the best configuration conceivable, but only costs 1. At the other 
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extreme, the configuration P in a pyramid with, say, all vertices on level L white-pebbled and all 
vertices on level L + 1 black-pebbled is potentially very expensive (for low levels L), but does 
not seem very useful. Since this configuration on the one hand is quite expensive, but on the other 
hand is extremely easy to derive (just white-pebble all vertices on level L, and then black-pebble 
all vertices on level L + 1), here the cost seems like a gross overestimation of the "goodness" of P. 

Klawe's potential measure remedies this. The potential of a pebble configuration (B, W) is 
defined as the minimum measure of any set U that together with W hides B. Recall that U{>z j} 
denotes the subset of all vertices in U on level j or higher in a layered graph G. 

Definition 9.8 (Measure). The jth partial measure of the vertex set U in G is 



Definition 9.9 (Potential). We say that U is a hiding set for a black-white pebble configuration 
P = (B, W) in a layered graph G if U U W hides B. We define the potential of the pebble 
configuration to be 

pot G (P) = pot G (B, W) = miu{m G (U) : U is a hiding set for (B, W)} . 

If U is a hiding set for {B, W) with minimal measure m G (?7) among all vertex sets U' such that 
U' U W hides B, we say that U is a minimum-measure hiding set for P. 

Since the graph under consideration will almost always be clear from context, we will tend to 
omit the subindex G in measures and potentials. 

We remark that although this might not be immediately obvious, there is quite a lot of nice 
intuition why Definition 19.91 is a relevant estimation of how "good" a pebble configuration is. We 
refer the reader to Section 2 of j37l for a discussion about this. Let us just note that with this 
definition, the pebble configuration P r = ({z}, 0) has high potential, as we shall soon see, while 
the configuration with all vertices on level L white-pebbled and all vertices on level L + 1 black- 
pebbled has potential zero. 

Remark 9.10. Klawe does not use the level of a vertex u in Definitions 19. 81 and 19.91 but instead the 
black pebbling price Peb({u} , 0) of the configuration with a black pebble on it and no other pebbles 
in the DAG. For pyramids, these two concepts are equivalent, and we feel that the exposition can 
be made considerably simpler by using levels. 

Klawe proves two facts about the potentials of the pebble configurations in any black-white 
pebbling V = {Po, • • • , P T } of a pyramid graph IT^: 

1. The potential correctly estimates the goodness of the current configuration Ft by taking into 
account the whole pebbling that has led to F t . Namely, pot (Pi) < 2 • max s <t{C0St(F s )}. 

2. The final configuration P T = ({z}, 0) has high potential, namely pot({z}, 0) = h + 0(1). 

Combining these two parts, one clearly gets a lower bound on pebbling price. 

For pyramids, part[2]is not too hard to show directly. In fact, it is a useful exercise if one wants 
to get some feeling for how the potential works. Part Q] is much trickier. It is proven by induction 
over the pebbling. As it turns out, the whole induction proof hinges on the following key property. 

Property 9.11 (Limited hiding-cardinality property). We say that the black- white pebble con- 
figuration P = (B, W) in G has the Limited hiding-cardinality property, or just the LHC property 
for short, if there is a vertex set U such that 




and the measure of U is mciU) 



max,,- 



{m> G (U)}. 
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1. U is a hiding set for P, 

2. pot G (P) = m(U), 

3. 17 = B or |C/| < |B| + |W| = COSf(P). 

We say that the graph G has the Limited hiding-cardinality property if all black-white pebble con- 
figurations P = (B, W) on G have the Limited hiding-cardinality property. 

Note that requirements Q] and [2] just say that U is a vertex set that witnesses the potential of P. 
The important point here is requirement [3j which says (basically) that if we are given a hiding set 
U with minimum measure but with size exceeding the cost of the black-white pebble configura- 
tion P, then we can pick another hiding set U' which keeps the minimum measure but decreases 
the cardinality to at most COSt(F). 

Given Property 19-1 11 the induction proof for part Q] follows quite easily. The main part of the 
paper OTTi is then spent on proving that a class of DAGs including pyramids have Property 19.1 II 
Let us see what the lower bound proof looks like, assuming that Property [9.111 holds. 

Lemma 9.12 (Theorem 2.2 in [371). Let G be a layered graph possessing the LHC property and 
suppose that V = {Po = 0, Pi, ... , P T } is any unconditional black-white pebbling on G. Then it 
holds for all t = 1, . . . , r that pot G (P t ) < 2 • m&x s < t {C0St(P s )}. 

Proof. To simplify the proof, let us assume without loss of generality that no white pebble is ever 
removed from a source. If V contains such moves, we just substitute for each such white pebble 
placement on v a black pebble placement on v instead, and when the white pebble is removed we 
remove the corresponding black pebble. It is easy to check that this results in a legal pebbling V 
that has exactly the same cost. 

The proof is by induction. The base case Po = is trivial. For the induction hypothesis, 
suppose that pot(Pt) < 2 • max s < 4 {cosf(P s )} and let Ut be a vertex set as in Property |9.1 1[ i.e., 
such that U t U W t hides B t , pot(P t ) = m(U t ) and \U t \ < COSt(F t ) = \B\ + \W\. 

Consider Pt+i- We need to show that pot(Pt+i) < 2 • max s <t+i{C0St(P s )}. By the induction 
hypothesis, it is sufficient to show that 

pot(P m ) < max{pot(Pi),2 • COSt(¥ t+1 )} . (9.1) 

We also note that if Ut U Wt+i hides B t +\ we are done, since if so pot(Pt+i) < m(Ut) = pot(Pt). 
We make a case analysis depending on the type of move made to get from P t to Pt+i. 

1. Removal of black pebble: In this case, Ut U Wt+i = Ut U Wt obviously hides Bt+\ C B t 
as well, so pot(P m ) < pot(P t ). 

2. Placement of white pebble: Again, U t U W t +i D U t UW t hides B t+ \ = B t , so pot(P m ) < 

pot (Pi). 

3. Removal of white pebble: Suppose that a white pebble is removed from the vertex w, so 
Wt+i = Wt \ {w}. As noted above, without loss of generality w is not a source vertex. We 
claim that Ut U Wt+i still hides B t +i = B t , from which pot(P t+ i) < pot(P t ) follows as 
above. 

To see that the claim is true, note that pred{w) C B t U Wt by the pebbling rules, for 
otherwise we would not be able to remove the white pebble on w. If pred(w) C Wt we are 
done, since then Ut U Wt+i hides Ut U Wt and we can use the transitivity in Proposition |9.7i 
If instead there is some v G pred(w) n B t , then Ut U Wt = Ut U Wt+i U {w} hides v 
by assumption. Since w is a successor of v, and therefore on a higher level than v, we must 
have Ut U Wt \ {w} hiding v. Thus in any case Ut U Wt+i hides pred(w), so by transitivity 
U t U W t +i hides B t+1 . 



49 



TOWARDS AN OPTIMAL SEPARATION 



4. Placement of black pebble: Suppose that a black pebble is placed on v. If v is not a source, 
by the pebbling rules we again have that pred(v) CBjU Wt. In particular, B t U Wt hides v 
and by transitivity we have that XJ t U Wt+i = Ut U Wt hides B t U {v} = B t +\. 

The case when v is a source turns out to be the only interesting one. Now Ut U Wt does not 
necessarily hide B t U {v} = B t +\ any longer. An obvious fix is to try with Ut U {v} U Wt 
instead. This set clearly hides Bt+i, but it can be the case that m(Ut U {v}) > m(Ut). 
This is problematic, since we could have pot(Pt+i) = m(Ut U {v}) > m(Ut) = pot(Pt). 
And we do not know that the inequality pot(P t ) < 2 • COSt(F t ) holds, only that pot(P t ) < 
2 • max s < t {c0St(F s )}. This means that it can happen that pot(P^_|_i) > 2 • COSf (P(_i ), 
in which case the induction step fails. However, we claim that using the Limited hiding- 
cardinality property [9. lll we can prove for Ut+i = Ut U {v} that 

m(U t+ i) = m(U t U {v}) < max{m(U t ),2 ■ COSt(F t+ i)} , (9.2) 
which shows that d9.ll ) holds and the induction steps goes through. 

Namely, suppose that Ut is chosen as in Property [9. 11 l and consider Ut+i = Ut U {v}. Then 
Ut+i is a hiding set for Ft+i = (B t U {u}, Wt) and hence pot(P<+i) < m(Ut+i). For 
j > 0, it holds that U t+ i{hj} = U t {hj} and thus m j (U t+ i) = m j (U t ). On the bottom 
level, using that the inequality \Ut\ < COSt(F t ) holds by the LHC property, we have 

m°(U t+l ) = 2 ■ |tT t+1 | = 2 • (\U t \ + 1) < 2 • (cost (¥ t ) + 1) = 2 • COS?(P t+1 ) (9.3) 

and we get that 

m(U t+ i) = maxj{m j (Ut + i)} = max{maXj >0 {m J '([/ t )},m ([/ w )} 

< max{m(U t ), 2 • COSt(F t+ i)} = max{pot(P t ), 2 • COSt(F t+ i)} (9.4) 

which is exactly what we need. 

We see that the inequality ( 19. II) holds in all cases in our case analysis, which proves the lemma. □ 

The lower bound on black-white pebbling price now follows by showing that the final pebble 
configuration ({z}, 0) has high potential. 

Lemma 9.13. For z the sink of a pyramid Yi^ of height h, the pebble configuration ({z},0) has 
potential pot nh ({,z}, 0) = h + 2. 

Proof. This follows easily from the Limited hiding-cardinality property (which says that U can be 
chosen so that either U C {z} or \U\ < 0), but let us show that this assumption is not necessary 
here. The set U = {z} hides itself and has measure m(U) = m h (U) = h + 2-1 = h + 2. Suppose 
that z is hidden by some U' ^ {z}. Without loss of generality U' is minimal, i.e., no strict subset 
of U' hides z. Let u be a vertex in U' on minimal level minlevel(J7) = L < h. The fact that TJ' is 
minimal implies that there is a path P : u ~» z such that (P \ {u}) D U' = (otherwise V \ {u} 
would hide z). By Observation 19.51 there must exist h — L converging paths from sources to z that 
are all blocked by distinct pebbles in U' \ {u}. It follows that 

m(U') > m L (U') = L + 2\U'{hL}\ = L + 2\U'\ > L + 2 ■ (h + 1 - L) > h + 2 (9.5) 

(where we used that U'{^L} = U' since L = minlevel(J7)). Thus U = {z} is the unique 
minimum-measure hiding set for ({z}, 0), and the potential is pot({2;}, 0) = h + 2. □ 

Since [37 ] proves that pyramids possess the Limited hiding-cardinality property, and since there 
are pebblings that yield matching upper bounds, we have the following theorem. 
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Theorem 9.14 BW-Peb(U h ) = § + 0(1). 

Proof. The upper bound was shown in Lemma l9~2l For the lower bound, Lemma l9.13l says that the 
final pebble configuration ({z}, 0) in any complete pebbling V of TL^ has potential pot({z}, 0) = 
h + 2. According to Lemma|9ll2l pot({z}, 0) < 2 • COSt(V). Thus eW-Petyll/J > h/2 + 1. □ 

In the final two subsections of this section, we provide a fairly detailed overview of the proof 
that pyramids do indeed possess the Limited hiding-cardinality property. As was discussed above, 
the reason for giving all the details is that we will need to use and modify the construction in non- 
trivial ways in the next section, where we will use ideas inspired by Klawe's paper to prove lower 
bounds on the pebbling price of pyramids in the blob-pebble game. 

9.3 Proving the Limited Hiding-Cardinality Property 

We present the proof of that pyramids have the Limited hiding-cardinality property in a top-down 
fashion as follows. 

1. First, we study what hiding sets look like in order to better understand their structure. Along 
the way, we make a few definitions and prove some lemmas culminating in Definition 19.201 
and Lemma 19.241 

2. We conclude that it seems like a good idea to try to split our hiding set into disjoint com- 
ponents, prove the LHC property locally, and then add everything together to get a proof 
that works globally. We make an attempt to do this in Theorem 19.251 but note that the argu- 
ment does not quite work. However, if we assume a slightly stronger property locally for our 
disjoint components (Property 19.271 ), the proof goes through. 

3. We then prove this stronger local property by assuming that pyramid graphs have a certain 
spreading property (Definition 19.341 and Theorem 19.351 ). and by showing in Lemmas 19.331 
and |9.36| that the stronger local property holds for such spreading graphs. 

4. Finally, in Section 19741 we give a simplified proof of the theorem in [37] that pyramids are 
indeed spreading. 

From this, the desired conclusion follows. 

For a start, we need two definitions. The intuition for the first one is that the vertex set U is 
tight if is does not contain any "unnecessary" vertex u hidden by the other vertices in U. 

Definition 9.15 (Tight vertex set). The vertex set U is tight if for all u G U it holds that u ^ 

PA MH. 

If x is a vertex hidden by U, we can identify a subset of U that is necessary for hiding x. 

Definition 9.16 (Necessary hiding subset). If x G [[J7|] , we define U |ja;J] to be the subset of U 
such that for each u G U\[x]\ there is a source path P ending in x for which P n U = {u}. 

We observe that if U is tight and u G U, then U\lu]\ = {u}. This is not the case for non-tight 
sets. If we let U = {u} U pred(u) for some non-source u, Definition 19 . 1 6 1 yields that U\lu]\ = 0. 
The vertices in ?7|lx]j must be contained in every subset of U that hides x, since for each v G [/[[xJJ 
there is a source path to x that intersects U only in v. But if U is tight, the set U H^J) is also sufficient 
to hide x, i.e., x G p/|lxl|]|. 

Lemma 9.17 (Lemma 3.1 in [37 1). If U is tight and x G \\U~^, then C/|[xJ] hides x and this set is 
also contained in every subset of U that hides x. 
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Proof. The necessity was argued above, so the interesting part is that x E p/Uadfl]- Suppose not. 
Let Pi be a source path to x such that Pi n U][x]\ = 0. Since U hides x, U blocks P\. Let v be 
the highest-level element in Pi n U (i.e., , the vertex on this path closest to x). Since U is tight, 
U \ {v} does not hide v. Let P2 be a source path to v such that P2 H (U \ {v}) = 0. Then going 
first along P2 and switching to Piin v we get a path to x that intersects U only in v. But if so, we 
have v E E/|la;J| contrary to assumption. Thus, x E f[J7 LL^JJTI must hold. □ 

Given a vertex set U, the tight subset of U hiding the same elements is uniquely determined. 

Lemma 9.18. For any vertex set U in a layered graph G there is a uniquely determined minimal 
subset U* C U such that \\U*~\\ = \\U~\\, U* is tight, and for any U' C U with [ft/ 7 ]] = \\U~\\ it 
holds that U* C [/'. 

Proof. We construct the set {/* bottom-up, layer by layer. We will let U* be the set of vertices on 
level i or lower in the tight hiding set under construction, and VJ be the set of vertices in U strictly 
above level i remaining to be hidden. 

Let L = mmlevel(P). For i < L, we define U* = 0. Clearly, all vertices on level L in P must 
be present also in U*, since no vertices in U{>- L} can hide these vertices and vertices on the same 
level cannot help hiding each other. Set {/£ = P{~ L} = U \ U{>- L}. Now we can remove from 
U all vertices hidden by [/£, so set U r L = U \ [fi/£"J]. Note that there are no vertices on or below 
level L left in U r L , i.e., P£ = U^{>- L}, and that ?7£ hides the same vertices as does U{-< L} (since 
the two sets are equal). 

Inductively, suppose we have constructed the vertex sets U*_ 1 and Just as above, set 

U* = [/"*_! U U%_ x {~i} and U[ = UJ^ \ ff!7*]|. If there are no vertices remaining on level i 
to be hidden, i.e., if Z7J_ 1 { ~ ^ } = 0, nothing happens and we get U* = U*_ 1 and U[ = U[_ 1 . 
Otherwise the vertices on level i in are added to U* and all of these vertices, as well as any 
vertices above in U[_ 1 now being hidden, are removed from U[_ 1 resulting in a smaller set U[. 

To conclude, we set U* = for M = maxlevel(?7). By construction, the invariant 

wn = m±m (9-6) 

holds for all levels i Thus, [fZ7*fl = f[C/]j. Also, U* must be tight since if v E U* and level(w) = i, 
by construction U*{^i} does not hide v, and (as was argued above) neither does U*{^i} \ {v}. 
Finally, suppose that U' C U is a hiding set for U with U* ^ V. Consider v E U*\U' and suppose 
level(u) = i. On the one hand, we have v (£ [[f7*_ 1 ]] by construction. On the other hand, by 
assumption it holds that v E f[f/'{-< and thus v E f[f{-< i}H. But then by the invariant (19.6b we 
know that u E [fC/ l *_ 1 ]], which yields a contradiction. Hence, U* C [/' and the lemma follows. □ 

We remark that U* can in fact be seen to contain exactly those elements u E U such that u is 
not hidden by U \ {u}. 

It follows from Lemma [9. 181 that if U is a minimum-measure hiding set for P = (B, W), we 
can assume without loss of generality that U U W is tight. More formally, if [7 U W is not tight, 
we can consider minimal subsets U' Q U and W' C such that £7' U W' hides B and is tight, 
and prove the LHC property for B and W' with respect to this U' instead. Then clearly the LHC 
property holds also for B and W. 

Suppose that we have a set U that together with W hides B. Suppose furthermore that B 
contains vertices very far apart in the graph. Then it might very well be the case that U U W can 
be split into a number of disjoint subsets Ui U W% responsible for hiding different parts Bi of P, 
but which are wholly independent of one another. Let us give an example of this. 

Example 9.19. Suppose we have the pebble configuration (P, W) = ({xi, J/i, v$}, {w^, sq, S7}) 
and the hiding set U = {v±,U2, 1*3, U3, S5} in Figure [T0( a) | Then U U W hides P, but U seems un- 
necessarily large. To get a better hiding set U* , we can leave 55 responsible for hiding v$ but replace 
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(a) Hiding set U with large size and measure. (b) Smaller hiding set U* with smaller measure. 

Figure 10: Illustration of hiding sets in Example [9T9] (with vertices in hiding sets cross-marked). 



{vi,U2, U3, vs} by {xi, yi}. The resulting set U* = {xi, yi, S5} in Figure [K)(b)| has both smaller 
size and smaller measure (we leave the straightforward verification of this fact to the reader). 

Intuitively, it seems that the configuration can be split in two components, namely (B\, W\) = 
({x 1 ,y 1 },{w 3 }) with hiding set Ui = {v 1 ,u 2 ,u 3 ,v 3 } and (B 2 ,W 2 ) = ({v 5 }, {s 6 , s 7 }) with 
hiding set U 2 = {s$}, and that these two components are independent of one another. To improve 
the hiding set U, we need to do something locally about the bad hiding set U\ in the first component, 
namely replace it with U* = {x\,y\}, but we should keep the locally optimal hiding set U 2 in the 
second component. 

We want to formalize this understanding of how vertices in B, W and U depend on one another 
in a hiding set U U W for B. The following definition constructs a graph that describes the structure 
of the hiding sets that we are studying in terms of these dependencies. 

Definition 9.20 (Hiding set graph). For a tight (and non-empty) set of vertices X in G, the hiding 
set graph TL = TL(G, X) is an undirected graph defined as follows: 

• The set of vertices of H is V(H) = [[X]]. 

• The set of edges E(TL) of TL consists of all pairs of vertices (x, y) for x, y € [pTf] such that 

g% n rrxu*jfli n G y A n PM^. 

We say that the vertex set X is hiding-connected if H(G, X) is a connected graph. 

When the graph G and vertex set X are clear from context, we will sometimes write only TL{X) 
or even just TL. To illustrate Definition I9.20L we give an example. 

Example 9.21. Consider again the pebble configuration (B, W) = {{x\, y%, v$}, {11)3, sq, S7}) from 
Example |9 . 1 91 with hiding set U = {vi , u 2 , U3, v 3, S5}, where we have shaded the set of hidden ver- 
tices in Figure [TT(a)| The hiding set graph TL(X) for X = U U W = {v\,u 2 ,U3, v$, W3, S5, sq, 57} 
has been drawn in Figure [Tl(b)| In accordance with the intuition sketched in Example l9.19[ TL(X) 
consists of two connected components. 

Note that there are edges from the top vertex y\ in the first component to every other vertex 
in this component and from the top vertex v 5 to every other vertex in the second component. We 
will prove presently that this is always the case (Lemma |9.22| ). Perhaps a more interesting edge 
in TC(X) is, for instance, (wi,x 2 ). This edge exists since X^ijj = {vi,u 2 ,us} and X^JJ = 
{u 2 , U3, V3, W3} intersect and since as a consequence of this (which is easily verified) we have 
fl^ 1 n pfUiuiJ]]] PI n^ 2 n pfUsajfll / 0- For the same reason, there is an edge (n 5 ,n 6 ) since 



X\[u 5 \l = {s 5 , s 6 } and X|i« 6 JJ = {s 6 , s 7 } intersect. 
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(a) Vertices hidden by U u W. (b) Hiding set graph H(U u W). 



Figure 1 1 : Pebble configuration with hiding set and corresponding hiding set graph. 

Lemma 9.22. Suppose for a tight vertex set X that x £ ][X]] and y £ X[[x]J. Then x and y are in 
the same connected component ofTC(X). 

Proof. Note first that x, y £ \\X~\\ by assumption, so x and y are both vertices in TL(X). Since x 
is above y we have G% D and we get G£ n [ly] n n |Iwl = [T^WH n G\ n 
{y} = {y} 7^ 0- Thus, (x, y) is an edge in TL{X), so x and y are certainly in the same connected 
component. □ 

Corollary 9.23. If X is tight and x £ [PHI x ant/ a// ofXHrrJJ are in ?/je same connected 
component ofTt(X). 

The next lemma says that if TL{X) is a hiding set graph with vertex set V = [f-Xf], then 
the connected components V\, . . . , Vk of TC(X) are themselves hiding set graphs defined over the 
hiding-connected subsets X n Vi, . . . , X n 14- 

Lemma 9.24 (Lemma 3.3 in |37 |). Let X be a tight set and let Vi be one of the connected com- 
ponents in 7i{X). Then the subgraph ofTC(X) induced by Vi is identical to the hiding set graph 
7i(X fl Vi) defined on the vertex subset X n Vi. In particular, it holds that Vi = \\X n ViJ|. 

Proof. We need to show that Vi = \\X n V^f] and that the edges of TL{X) in are exactly the 
edges in H(X n Vi). Let us first show that y £ if and only if y £ pT n Vi]]. 

(=^) Suppose y £ Vi. Then Xy C Vi by Corollary 19.231 Also, C X by definition, so 

XM Q X n Vi. Since y £ by Lemma |9~T71 clearly y £ ][X n Vi]]. 

(-4=) Suppose y £ f[JC n Vi]]. Since X is tight, its subset I fl 7; must be tight as well. 
Applying Lemma |9~T71 twice, we deduce that (X n Vi)W hides y and that ly C (X n Vi)|[?/JJ 
since X^JJ is contained in any subset of X that hides y. But then a third appeal to Lemma [9. 171 
yields that (X n Vi)M C Iw since Ijjj C (X n Vi)M C X n Vi and consequently 

XM = (X n Vi)W • (9-7) 
By Corollary y and all of (X n Vi) M = %M 

are in the same connected component. Since 

XHj/J] C Vi it follows that y £ V{. 

This shows that Vi = f[X n Vi]]. Plugging (19.71 ) into Definition 19.201 we see that (x, y) is an 
edge in Ti{X) for x, y £ Vj if and only if (x, y) is an edge in H(X n Vi). □ 

Now we are in a position to describe the structure of the proof that pyramid graphs have the 
LHC property. 
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Theorem 9.25 (Analogue of Theorem 3.7 in Il37l0 . Let P = (B, W) be any black-white pebble 
configuration on a pyramid II. Then there is a vertex set U such that U U W hides B, potn (P) = 
m(U) and either U = Bor\U\ < \B\ + \W\. 

The idea is to construct the graph TL = TL(H, U U W), study the different connected compo- 
nents in TL, find good hiding sets locally that satisfy the LHC property (which we prove is true for 
each local hiding-connected subset of U U W), and then add all of these partial hiding sets together 
to get a globally good hiding set. 

Unfortunately, this does not quite work. Let us nevertheless attempt to do the proof, note where 
and why it fails, and then see how Klawe fixes the broken details. 

Tentative proof of Theorem \9. 251 Let U be a set of vertices in II such that U U W hides B and 
pot(P) = m(U). Suppose that U has minimal size among all such sets, and furthermore that 
among all such minimum-measure and minimum-size sets U has the largest intersection with B. 

Assume without loss of generality (Lemma 19.181) that U U W is tight, so that we can con- 
struct TL. Let the connected components of TL be V\, ■ ■ ■ , For all i = 1, . . . , k, let Bi = B Pi V%, 
Wi = W n Vi, and Ui = U D Vi. Lemma l9T24l says that Ui U W{ hides B { . In addition, all Vi are 
pairwise disjoint, so \B\ = Ei=i\Bi\, \W\ = Etil^l and \U\ = Etil^l- 

Thus, if the LHC property [9.111 does not hold for U globally, there is some hiding-connected 
subset Ui U W{ that hides Bi but for which |Z7j| > \Bi \ + \ W%\ and Ui ^ Bi. Note that this implies 
that Bi <£. Ui since otherwise Ui would not be minimal. 

Suppose that we would know that the LHC property is true for each connected component. 
Then we could find a vertex set U* with U* C Bi or | U* \ < \Bi\ + |Wj| such that U* U Wi hides 
Bi and m(U*) < m(Ui). Setting U* = (U \ U) U U* , we would get a hiding set with either 
\U*\ < \U\ or \U* n B\ > \U n B\. The second inequality would hold since if \U*\ = \U\, then 
\U*\ = \U\ > \Bi U Wi\ and this would imply U* = B { and thus \U* n B { \ > \U n Bi\. This 
would contradict how U was chosen above, and we would be home. 

Almost. We would also need that U* could be substituted for Ui in U without increasing the 
measure, i.e., that m(U*) < m(Ui) should imply m((U\ U) U U*) < m((U\ U) U U). 
And this turns out not to be true. □ 

The reason that the proof above does not quite work is that the measure in Definition 19.81 is 
ill-behaved with respect to unions. Klawe provides the following example of what can happen. 

Example 9.26. With vertex labels as in Figures 171 and l9TfTTl let X\ = {s%,S2}, X2 = {w\} 
and X3 = {S3}. Then m(Xi) = 4 and m{X2) = 5 but taking unions with X3 we get that 

m(X 1 U X 3 ) =6 and m(X 2 U X 3 ) = 5. Thus m(X 1 ) < m(X 2 ) but m(X 1 U X 3 ) > 
m(X 2 U X 3 ). 

So it is not enough to show the LHC property locally for each connected component in the 
graph. We also need that sets U from different components can be combined into a global hiding 
set while maintaining measure inequalities. This leads to the following strengthened condition for 
connected components of TL. 

Property 9.27 (Local limited hiding-cardinality property). We say that the pebble configuration 
P = (B, W) has the Local limited hiding-cardinality property, or just the Local LHC property for 
short, if for any vertex set U such that U U W hides B and is hiding-connected, we can find a 
vertex set U* such that 

1. U* is a hiding set for (B,W), 

2. for any vertex set Y with Y n U = it holds that m(Y U U*) < m(Y U U), 

3. U* C Bor \U*\ < \B\ + \W\. 
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We say that the graph G has the Local LHC property if all black-white pebble configurations P = 
(B,W) onGdo. 

Note that if the Local LHC property holds, this in particular implies that m (U*) < m(U) (just 
choose Y = 0). Also, we immediately get that the LHC property holds globally. 

Lemma 9.28. If G has the Local limited hiding-cardinality property \9.27\ then G has the Limited 
hiding-cardinality property 19.771 

Proof. Consider the tentative proof of Theorem 19.251 and look at the point where it breaks down. 
If we instead use the Local LHC property to find U* , this time we get that m(U*) < m(Ui) does 
indeed imply m ((77 \ Ui) U U*) < m ((U \ U) U Ui) , and the theorem follows. □ 

An obvious way to get the inequality m(Y U U*) < m(Y U U) in Property 19.271 would be 
to require that m?(U*) < m^{U) for all j, but we need to be slightly more general. The next 
definition identifies a sufficient condition for sets to behave well under unions with respect to the 
measure in Definition 19. 8i 

Definition 9.29. We write U ^ m V if for all j > there is an i < j such that w?{U) < m l (V). 

Note that it is sufficient to verify the condition in Definition 19. 29 1 for j = 1, . . . , maxlevel({7). 
For j > maxlevel(?7) we get m?(U) = and the inequality trivially holds. 

It is immediate that U ^ m V implies m{U) < m(V), but the relation ^ m gives us more 
information than that. Usual inequality m(U) < m{V) holds if and only if for every j we can 
find an i such that m J ({7) < m l (V), but in the definition of ^ m we are restricted to finding such 
an index i that is less than or equal to j. So not only is m(U) < m(V) globally, but we can also 
explain locally at each level, by "looking downwards", why U has smaller measure than V. 

In Example 19.261 X\ X2 since the relative cheapness of X\ compared to X2 is explained 
not by a lot of vertices in X2 on low levels, but by one single high-level, and therefore expensive, 
vertex in X2 which is far above X\ . This is why these sets behave badly under union. If we have 
two sets X\ and X2 with X\ X2, however, reversals of measure inequalities when taking 
unions as in Example l9.26l can no longer occur. 

Lemma 9.30 (Lemma 3.4 in (37)). IfU < m V and Y n V = 0, then m(Y U U) < m{Y U V). 

Proof. To show that m(Y U U) < m(Y U V), for each level j = 1, . . . , maxlevel(y U U) we 
want to find a level i such that m?{Y U U) < m l (Y U V). We pick the i < j provided by 
the definition of U ^ m V such that m^{U) < m % {V). Since V n W = and i < j implies 
Y{hj} C Y{hi}, we get 

m?{Y U U) =j + 2-\(U U Y){yj}\ < j + 2 ■ \U{yj}\ + 2 • \Y{hj}\ < 

i + 2 • \V{±i}\ + 2 • \Y{hi}\ = m*(r U V) (9.8) 
and the lemma follows. □ 

So when locally improving a blocking set U that does not satisfy the LHC property to some set 
U* that does, if we can take care that U* ^ rn U in the sense of Definition 19.291 we get the Local 
LHC property. All that remains is to show that this can indeed be done. 

When "improving" U to U*, we will strive to pick hiding sets of minimal size. The next 
definition makes this precise. 

Definition 9.31. For any set of vertices X, let 

L h] (X) = mm{\Y\ : X{hj} C \\YJ and Y{hj] = Y] 

denote the size of a smallest set Y such that all vertices in Y are on level j or higher and Y hides 
all vertices in X on level j or higher. 
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Note that we only require of Y to hide X{^j} and not all of X. Given the condition that 
Y = Y{>: j}, this set cannot hide any vertices in X{-< j}. We make a few easy observations. 

Observation 9.32. Suppose that X is a set of vertices in a layered graph G. Then: 

1. L^o(X) is the minimal size of any hiding set for X. 

2. IfXC Y, then L tj (X) < L hj (Y) for all j. 

3. It always holds that Lyj(X) < \X{yj}\ < \X\. 

Proof Part □ follows from the fact that F-j>0} = V for any set V. If X C Y, then X{±j} C 
Y{tzj} and any hiding set for X{>z j} works also for Y{^j}, which yields part[2] Part[3]holds 
since X{>z j} C X is always a possible hiding set for itself. □ 

For any vertex set V in any layered graph G, we can always find a set hiding V that has 
"minimal cardinality at each level" in the sense of Definition 19.3 11 

Lemma 9.33 (Lemma 3.5 in |37 |). For any vertex set V we can find a hiding set V* such that 
\V*{hj}\ < L hj (V)forall j, and either V* = V or \V*\ < \V\. 

Proof If < Lyj(V) for all j, we can choose V* = V. Suppose this is not the case, and 

let k be minimal such that \ V{>: k}\ > Lyf.(V). Let V be a minimum-size hiding set for V{>: k} 
with V = V'{hk} and |V'| = \L hk (V)~\ and set V* = V{-<k] U V. Since V{-<k] hides itself 
(any set does), we have that V* hides V = V{^k} lilV{hk} and that 

\V*\ = \V{-<k}\ + \V'\ < \V{-<k}\ + \V{hk}\ = \V\ . (9.9) 

Combining (19.91) with part [U of Observation 19.321 we see that the minimal index found above must 
be k = 0. Going through the same argument as above again, we see that | j}| < Lyj(V) for 
all j, since otherwise ( 19.91 ) would yield a contradiction to the fact that V = F'-JVO} was chosen 
as a minimum-size hiding set for V. □ 

We noted above that Lyo(X) is the cardinality of a minimum-size hiding set of X. For j > 0, 
the quantity Lyj(X) is large if one needs many vertices on level > j to hide X{>zj}, i.e., if 
X{^j} is "spread out" in some sense. Let us consider a pyramid graph and suppose that X is a 
tight and hiding-connected set in which the level-difference maxlevel(X) — minlevel(X) is large. 
Then it seems that \X\ should also have to be large, since the pyramid "fans out" so quickly. This 
intuition might be helpful when looking at the next, crucial definition of Klawe. 

Definition 9.34 (Spreading graph). We say that the layered DAG G is a spreading graph if for 
every (non-empty) hiding-connected set X in G and every level j = 1, . . . , maxlevel(f[X|), the 
spreading inequality 

\X\ > L^(pTfl) + j - minlevel(X) (9.10) 

holds. 

Let us try to give some more intuition for Definition 19 . 34 1 by considering two extreme cases in 
a pyramid graph: 

• For j < minlevel(X), we have that the term j — minlevel(X) is non-positive, X{>: j} = X, 
and [pf{> j}| = [pfj]. In this case, ( 19.101 ) is just the trivial fact that no set that hides [jX[] 
need be larger than X itself. 

• Consider j = maxlevel([pT]]), and suppose that [fX{^j}"[] is a single vertex v with X\[x\\ = 
X. Then (19.101 ) requires that \X\ > 1 + level(x) — minlevel(X), and this can be proven to 
hold by the "converging paths" argument of Theorem 19.3 1 and Observation |9.5i 
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Very loosely, Definition 19.341 says that if X contains vertices at low levels that help to hide other 
vertices at high levels, then X must be a large set. Just as we tried to argue above, the spreading 
inequality (19.101 ) does indeed hold for pyramids. 

Theorem 9.35 ([37 1). Pyramids are spreading graphs. 

Unfortunately, the proof of Theorem 19. 35 l in 11371 is rather involved. The analysis is divided into 
two parts, by first showing that a class of so-called nice graphs are spreading, and then demonstrat- 
ing that pyramid graphs are nice. In Section l9~4l we give a simplified, direct proof of the fact that 
pyramids are spreading that might be of independent interest. 

Accepting Theorem 19.351 on faith for now, we are ready for the decisive lemma: If our layered 
DAG is a spreading graph and if U U W is a hiding-connected set hiding B such that U is too large 
for the conditions in the Local limited hiding-cardinality property |9.27| to hold, then replacing U by 
the minimum-size hiding set in Lemma l9.33l we get a hiding set in accordance with the Local LHC 
property. 

Lemma 9.36 (Lemma 3.6 in |37|). Suppose that B,W,U are vertex sets in a layered spreading 
graph G such that U U W hides B and is tight and hiding-connected. Then there is a vertex set 
U* such that U* U W hides B, U* < m U, and either U* = B or \U*\ < \B\ + \W\. 

Postponing the proof of Lemma [9.361 for a moment, let us note that if we combine this lemma 
with Lemma 19.301 and Theorem 19.351 the Local limited hiding-cardinality property for pyramids 
follows. 

Corollary 9.37. Pyramid graphs have the Local limited hiding-cardinality property \9.27\ 

Proof of Corollary \9. 3 71 This is more or less immediate, but we write down the details for com- 
pleteness. Since pyramids are spreading by Theorem 19.35 1 Lemma |9.36| says that U* is a hiding set 
for (B, W) and that U* < m U. Lemma WM then yields that m(Y U 17*) < m(Y U U) for all Y 
with Y n U = 0. Finally, Lemma |9~36l also tells us that U* C B or \U*\ < \B\ + \ W\, and thus 
all conditions in Property [9 .27 1 are satisfied. □ 

Continuing by plugging Corollary 19.371 into Lemma 19.281 we get the global LHC property in 



Theorem 9.25 on page 55 So all that is needed to conclude Klawe's proof of the lower bound for 
the black- white pebbling price of pyramids is to prove Theorem 19.351 and Lemma |9.36l We attend 
to Lemma l9.36l right away, deferring a proof of Theorem 19. 35 1 to the next subsection. 

Proof of Lemma \9~36\ If \U\ < \B\ + \W\ we can pick U* = U and be done, so suppose that 
\U\ > \B\ + \ W\. Intuitively, this should mean that U is unnecessarily large, so it ought to be 
possible to do better. In fact, U is so large that we can just ignore W and pick a better U* that hides 
B all on its own. 

Namely, let U* be a minimum-size hiding set for B as in Lemma l9.33l Then either U* = B or 
\U*\ < \B\ < \B\ + \ W\. To prove the lemma, we also need to show that U* ^ m U, which will 
guarantee that U* behaves well under union with other sets with respect to measure. 

Before we do the the formal calculations, let us try to provide some intuition for why it should be 
the case that U* ^ m U holds, i.e., that for every j we can find an i < j such that m? (U*) < m J (J7). 
Perhaps it will be helpful at this point for the reader to look at Example 19.191 again, where the 
replacement of U± = {vx, u%, 1*3, Vs} in Figure [T0(a)1 by J7* = {xi,yx} in Figure [10(b) | shows 



Lemmas l9.33l andl 9.36l in action. 

Suppose first that j < minlevel(t7 U W) < minlevel(J7). Then the measure inequality 
m?{U*) < mi(U) is obvious, since U{>: j} = U is so large that it can easily pay for all of U* , let 
alone U*{hj] Q U*. 
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For j > minlevel(£7 U W), however, we can worry that although our hiding set U* does in- 
deed have small size, the vertices in U* might be located on high levels in the graph and be very 
expensive since they were chosen without regard to measure. Just throwing away all white pebbles 
and picking a new set U* that hides B on its own is quite a drastic move, and it is not hard to con- 
struct examples where this is very bad in terms of potential (say, exchanging 55 for v§ in the hiding 
set of Example |9.19l ). The reason that this nevertheless works is that \ U\ is so large, that, in addition, 
U U W is hiding-connected, and that, finally, the graph under consideration is spreading. Thanks 
to this, if there are a lot of expensive vertices in U*{>z j} on or above some high level j resulting in 
a large partial measure m? (U*), the number of vertices on or above level L = minlevel(£/ U W) 
in U = U{ y L} is large enough to yield at least as large a partial measure m L (ll). 

Let us do the formal proof, divided into the two cases above. 

1. j < minlevel(i7 U W): Using the lower bound on the size of U and that level j is no higher 
than the minimal level of U, we get 



m' 



{U*)=j + 2-\U*{hj}\ 
< j + 2- \U*\ 
<j + 2-\B\ 
<3 + 2-\U\ 

= j + 2- \u{yj}\ 

= m j (U) 



[ by definition of m 3 (•) ] 

[ since V{>zj} Q V for any V ] 

[ by construction of U* in Lemma [9.331 

[ by assumption \U\ > \B\ + \W\ > \B 

[ U{>: j} = U since j < minlevel(C/) ] 

[ by definition of m? (•) ] 



and we can choose i = j in Definition 19.291 

2. j > minlevel(i7 U W): Let L = minlevel(C/ U W). The black pebbles in B are hidden by 
U U W, or in formal notation B C [[[/ U WJ, so 



L tj (B) < L hj ($U U Wf) 



(9.11) 



holds by part [2] of Observation 19.321 Moreover, U U W is a hiding-connected set of vertices 
in a spreading graph G, so the spreading inequality in Definition 19.341 says that \U U W\ > 
Lyj^U U Wl) +j-L,ot 



j + L hj (\\U U Wf) <L + \U U W\ 
after reordering. Combining (19.111 ) and (19.121) we have that 

j + L tj (B) < L + \U U W\ 

and it follows that 



(9.12) 



(9.13) 



m j (U*)=j + 2-\U*{hj}\ 

<j + \U*{hj}\ + \U*\ 
<j + L hj (B) + \B\ 
< L + \U U W\ + \B\ 
<L + 2-\U\ 
= L + 2-\U{hL}\ 
= m L {U) 



by definition of m 3 {-) ] 

since V{^j} C V for any V ] 

by construction of U* in Lemma 19.33 1 

by the inequality (19.131 ) ] 

by assumption \U\ > \B\ + \W\ ] 

U{^L} = U since L < minlevel(?7) 

by definition of m L (-) 1 



Thus, the partial measure of U at the minimum level L is always larger than the partial mea- 
sure of U* at levels j above this minimum level, and we can choose i = L in Definition |9.29i 
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Consequently, U* ^ m U, and the lemma follows. □ 

Concluding this subsection, we want to make a comment about Lemmas |9.33l and 19.361 and try 
to rephrase what they say about hiding sets. Given a tight set U U W such that B C [[[/ u Wj| , 
we can always pick a U* as in Lemma 19331 with U* = B or \U* | < \B\ and with |?7*{^i}| < 
Lyj(B) for all j. This will sometimes be a good idea, and sometimes not. Just as in Lemma |9.36[ 
for j > minlevel({7 U W) we can always prove that 

m j (U*) < minlevel(C/ U W) + \U\ + (\B\ + \W\) . (9.14) 

The key message of Lemma 19.361 is that replacing U by U* is a good idea if U is sufficiently 
large, namely if \U\ > \B\ + \W\, in which case we are guaranteed to get m^(U*) < m L (U) for 
L = minlevel(£7 U W). 

9.4 Pyramids Are Spreading Graphs 

The fact that pyramids are spreading graphs, that is, that they satisfy the inequality ( 19.101) , is a 
consequence of the following lemma. 

Lemma 9.38 (Ice-Cream Cone Lemma). If X is a tight vertex set in a pyramid II such that 
TC(X) is a connected graph with vertex set V = [fX|], then there is a unique vertex x 6 V such 
that X = IH and V = [pf UiJ]]] Q n^. 

What the lemma says it that for any tight vertex set X, the connected components V\, . . . , Vk 
look like ragged ice-cream cones turned upside down. Moreover, for each "ice-cream cone" Vi, all 
vertices in X n Vi are needed to hide the top vertex. The two connected components in Figure QT] 
are both examples of such "ice-cream cones." 

Before proving Lemma 19.381 we show how this lemma can be used to establish that pyramid 
graphs are spreading by a converging-paths argument as in Observation 19.51 

Proof of Theorem \9. 35\ Suppose that X is a tight and hiding-connected set, i.e., such that Tt(X) is 
a single connected component with set of vertices V = [[X]] . Let x € V be the vertex given by 
Lemma 19381 such that X = X\[x\\ and V = pTlJzJ]]] C IT£, and let M = level(x). 
For any j < M we have 

L hj (UJ) < M-j + 1 . (9.15) 

This is so since there are only so many vertices on level j in IIJ and the set of all these vertices 
must hide everything in [[X]] above level j since [[X]] C II x h . 

By assumption X is tight and all of X is needed to hide x, i.e., X = X\[x]\. Pick a vertex 
v E X on bottom level L = minlevel(X). Since v £ X|]xj] there is a path P : v ~^ x such 
that P n X = {v}. Consider the set of converging source paths for P in Observation 19.51 All 
these converging paths Pi, P2, . . . , Pm-l must be blocked by distinct vertices in X \ {v}, since 
P i H Pj C P \ {v} and P \ {v} does not intersect X. From this the inequality 



|X|>Af-L+l (9.16) 
follows. By combining (19.151 ) and (19.161 ), we get that 

\X\ - L^-flprH) > M - L + 1 - (M - j + 1) = j - L (9.17) 

which is the required spreading inequality (19- 1 Ob - □ 



The rest of this subsection is devoted to proving the Ice-Cream Cone Lemma. We will use that 
fact that pyramids are planar graphs where we can talk about left and right. More precisely, the 
following (immediate) observation will be central in our proof. 
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Observation 9.39. Suppose for a planar DAG G that we have a source path P to a vertex w and 
two vertices u, v S on opposite sides of P. Then any path Q : u v must intersect P. 

Given a vertex v in a pyramid II, there is a unique path that passes through v and in every vertex 
u moves to the right-hand successor of u. We will refer to this path as the north-east path through v, 
or just the NE-path through v for short, and denote it by Pne(v). The path through v always moving 
to the left is the north-west path or NW-path through v, and is denoted P^{v). For instance, for 
the vertex in our running example pyramid in Figure|7]we have Pne(^4) = {-S4, ^4, va, ^4} and 
Pnw(^4) = {^6, U5, V4, W3, X2,yi}. To simplify the proofs in what follows, we make a couple of 
observations. 

Observation 9.40. Suppose that X is a tight set of vertices in a pyramid U and that v E [PH]. 
Then pT|L„j|H C 11%. 

Proof. Since all vertices in X|JuJ] have a path to v by definition, it holds that C 11^. Any 

vertex u € II \ II^ must lie either to the left of Pne(^) or to the right of f^(«) (or both). In the 
first case, Pm(u) is a path via u that does not intersect X[]vJ], so u ^ [f-^WH- m the second case, 
we can draw the same conclusion by looking at Pwn(u). Thus, (II \ IT^j n [pT||u]]| = 0. □ 

Observation 9.41. Suppose that X is a tight set of vertices in a DAG G and that v G ^\X~\\. Then 
there is a source path P to v such that \P f] X \ = 1. 

Proof. Let Pi be any source path to v and note that Pi intersects X since v £ \\X~\\ . Let y be the 
last vertex on Pi in Pi n X, i.e., the vertex on the highest level in this intersection. Since X is 
tight, there is a source path P2 to y that does not intersect X \ {y}. Let P be the path that starts like 
P2 and then switches to Pi in y. Then |P fl X\ = \{y}\ = 1. □ 

Using Observations 19 .401 and 19.411 we can simplify the definition of the hiding set graph. Note 
that Observation [9T40] is not true for arbitrary layered DAGs, however, or even for arbitrary layered 
planar DAGs, so the simplification below does not work in general. 

Proposition 9.42. Let TC = TC(Ii,X) be the hiding set graph for a tight set of vertices X in a 
pyramid II, and suppose that u,v £ \\X~^. Then the following conditions are equivalent: 

1. (u, v) is an edge in U, i.e., 11^ n pwl n 11^ n [flwl + 0- 

2. [priHiH n pwl ± 0. 

3. iw n ih / 0. 

Proof. The directions (GQ) (0) and ([3]) © are immediate. The implication (0 =>• (Q~|) also 
follows easily, since [flH| C and [fX|L«JlH C by Observation 1^401 To prove © =>- ©, 
fix some vertex w G pw| n [[X|[?;J]]1 and let P be a source path to w as in Observation 19.411 
with P n X = {y} for some vertex y. Since P n X\[u]\ / l/ Pfl I[«J by assumption, we 
have y G X||_«JJ n / 0. □ 

As the first part of the proof of Lemma 19.381 we show that all vertices hidden by a hiding- 
connected set X are contained in a subpyramid, the top vertex of which is also hidden by X. This 
gives the ice-cream cone shape alluded to by the name of the lemma. 

Lemma 9.43. Let TC = H(TL, X) be the hiding set graph of a hiding-connected vertex set X in a 
pyramid II. Then there is a unique vertex x £ f[-X"fl such that [fX]] C II^. 
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Figure 12: Illustration of proof of Lemma I03l that TL is not connected if x £ pfTJ. 



Proof. It is clear that at most one vertex x G [[X]] can have the properties stated in the lemma. 
We show that such a vertex exists. As a quick preview of the proof, we note that it is easy to find 
a unique vertex x on minimal level such that f[X|| C 11^ . The crucial part of the lemma is that x 
is hidden by X. The reason that this holds is that the graph TL is connected. If x ^ [fX|], we can 
find a source path P to the top vertex z of the pyramid such that P does not intersect X but there 
are vertices in TL both to the left and to the right of P. But there is no way we can have an edge 
crossing P in TL, so the hiding set graph cannot be connected after all. Contradiction. 

The above paragraph really is the whole proof, but let us also provide the (somewhat tedious) 
formal details for completeness. To follow the formalization of the argument, the reader might be 
helped by looking at Figure[l2] Suppose that II has height h and let s\, S2, ■ ■ ■ , s/i+i be the sources 
enumerated from left to right. Look at the north-east paths Pne(si), Pne{s2), ■ ■ ■ and let Sj be the 
first vertex such that PneCsj) H [[X]] ^ 0. Similarly, consider fW(s/H-i), Pnw(sh), ■ ■ • and let Sj 
be the first vertex such that P^wisj) H [fX]] ^ 0. It clearly holds that i < j. 

Let x be the unique vertex where Pne^) and P-Nw{sj) intersect. By construction, we have 
[jX|] C n^, since no NE-path to the left of PneCsj) = -Pne(^) intersects [pf]] and neither does 
any NW-path to the right of P^w(sj) = P^-w(x). We need to show that it also holds that x G f[-X"fl- 

To derive a contradiction, suppose instead that x ^ \\X~^. By definition, there is a path P from 
some source s* to x such that P n [[X]] = 0. P cannot coincide with Pne(^) or Pnw(^) since 
the latter two paths both intersect [[X]] by construction. Since U.7 n [[X]] = 0, we can extend 
P to a path P* : s* ~^ z via x having the property that P* n [[X]] = but there are vertices in 
TL(X) both to the left and to the right of P*, namely, the non-empty sets Pne(^) n [[X]] n 11^ and 
Pnw(x) H [[X| n 11^. We claim that this implies that TL is not connected. This is a contradiction 
to the assumptions in the statement of the lemma and it follows that x S [[X]] must hold. 

To establish the claim, note that if TL is connected, there must exist some edge (u, v) between 
a vertex u to the left of P* and a vertex v to the right of P*. Then Proposition 19.421 says that 
f[X|LuJj]l n [[XmH / 0. Pick any vertex w £ pw| n and assume without loss of 

generality that w is on the right-hand side of P*. We prove that such a vertex w cannot exist. See 
the example vertices labelled u, v and w in Figure [121 which illustrate the fact that w £ [fX|L«l]| if 
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w g pHl- 

Since w is assumed to be hidden by ([J|u||, the NW-path through w must intersect X[[uJJ 
somewhere before w or in w. Fix any y G Pnw(^) H Iw n 11™ and note that y must also 
be located to the right of P*. By Definition 19.161 there is a source path P' via y to u such that 
P' n X = {y}. But P' must intersect P* somewhere above y, since y is to the right and u is to 
the left of P*. (Here we use Observation 19.391 ) Consider the source path that starts like P* and 
then switches to P' at some intersection point in P' n P* fi n^. This path reaches u but does not 
intersect X, contradicting the assumption u G pf[|. It follows that [fl[u|| n [IhI = for all 
u and v on different sides of P* , so there are no edges across P* in TL. This proves the claim. □ 

The second part needed to prove Lemma |9.38| is that all vertices in X are required to hide the 
top vertex x G [[X]] found in Lemma l9.43l 

Lemma 9.44. Let TL = TL(H, X) be the hiding set graph of a hiding-connected vertex set X in a 
pyramid II and let x G [[X]] be the unique vertex such that [[X]] C IL^. Then X = X\[xj\. 

Proof. By definition, X\[x\\ C X. We want to show that X|[iJ] = X. Again, let us first try to 
convey some intuition why the lemma is true. If X \ X\[x]\ ^ 0, since X is hiding-connected there 
must exist some vertex hidden by all of X but not by just X\[x]\ or X \ X|[xJJ (otherwise there can 
be no edge between the components of TL containing X\[x]\ and X \ X|[^JJ, respectively). But if so, 
it can be shown that the extra vertices in X \ X\[x]\ help X^Jj to hide one of its own vertices. This 
contradicts the fact that X is tight, so we must have X\\_x]\ = X which proves the lemma. 

Let us fill in the formal details in this proof sketch. Assume, to derive a contradiction, that 
XlxH / X. Since X is tight, it holds that {X \ X|]a;J]) n ^Iw| = 0, so H contains vertices 
outside of |]X|]a:jf[|. Since TL is connected, there must exist some edge (u,u') between a pair 
of vertices u G [[X]] \ [fXUzJ]]] and u' G [[X||_a;Jj|. Lemma I9T71 says that X[jyj] C X||_a:J and 
Proposition 19.421 then tells us that X||uJ] n X|[xJ] / 0. Also, Xh \ X|[xJJ / since u X|jxJJ. 
For the rest of this proof, fix some arbitrary vertices r G X|L«J] H X||zJ] and s G X|[uJ \X\[x]\. We 
refer to Figure [l3]for an illustration of the proof from here onwards. 
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By Definition ^. 161 there are source paths P r via r to u and P s via s to u that intersect X only 
in r and s, respectively. Also, there is a source path P to x such that P Pi X = {r} since r G XUxJ]. 
Suppose without loss of generality that s is to the right of P. The paths P s and P cannot intersect 
between s and u. To see this, observe that if P s crosses P after s but before r, then by starting with 
P and switching to P s at the intersection point we get a source path to u that is not blocked by X. 
And if the crossing is after r, we can start with P s and then switch to P when the paths intersect, 
which implies that s G X^x\\ contrary to assumption. Thus u is located to the right of P as well. 

Extend P s by going north-west from u until hitting P, which must happen somewhere in be- 
tween r and x, and then following P to x. Denote this extended path by Pf and let w be the vertex 
starting from which Pf and P coincide. The path P® must intersect X in some more vertex after 
s since s ^ X[[a:JJ. Pick any v G Pf fl (X \ {s}). By construction, v must be located strictly 
between u and w. We claim that X \ {v} hides v. This contradicts the tightness of X and the 
lemma follows. 

To prove the claim, consider any source path P v to v and assume that P v fl (X \ {v}) = 0. 
Then, in particular, r ^ P v . Suppose that P v passes to the left of r. By planarity, P v must intersect 
P somewhere above r. But if so, we can construct a source path P' to x that starts like P v and 
switches to P at this intersection point. We get P' n X = 0, which contradicts x G X|[a:JJ. If 
instead P v passes r on the right, then P v must cross P r in order to get to v. This implies that there 
is a source path P" to u such that P" n X = 0, namely the path obtained by starting to go along 
P v and then changing to P r when the two paths intersect above r. Thus we get a contradiction in 
this case as well. Hence, X \ {v} blocks any source path to v as claimed. □ 

The Ice-Cream Cone Lemma l9.38l now follows. Thereby, the proof of the lower bound on the 



black-white pebbling price of pyramid graphs in Theorem 9. 14 on page 5 1 is complete. 



10 A Tight Bound for Blob-Pebbling the Pyramid 

Inspired by Klawe's ideas in Section |9l we want to do something similar for the blob-pebble game 



in Definition 6.8 on page 28 1 In this section, we study blob-pebblable DAGs (Definition I6.6I ) that 



are also layered. We show that for all such DAGs Gh of height h that are spreading in the sense 
of Definition I9.341 it holds that Blob-Peb(Gh) = Q(h). In particular, this bound holds for pyra- 
mids Hh since they are spreading by Theorem I9.35 1 

The constant factor that we get in our lower bound is moderately small and explicit. In fact, we 
believe that it should hold that Blob-Peb(G h ) > h/2 + 0(1) for layered spreading graphs G h of 
height h, just as in the standard black-white pebble game. As we have not made any real attempt 
to get optimal constants, the factor in our lower bound can be improved with a minor effort, but 
additional ideas seems to be needed to push the constant all the way up to ^. 

10.1 Definitions and Notation for the Blob-Pebbling Price Lower Bound 

Recall that a vertex set U hides a black pebble on b if it blocks all source paths visiting v. For a 
blob B, which is a chain by Definition I6.7I it appears natural to extend this definition by requiring 
that U should block all paths going through all of B. We recall the terminology and notation from 
Definition I6. 51 that a black blob B and a path P agree with each other, or that P is a path via B, if 
PCP, and that *p v ; a (P) denotes the set of all source paths agreeing with B. 

Definition 10.1 (Blocked black blob). A vertex set U blocks a blob B if U blocks all P G *£ v ia(P)- 



A terminological aside: Recalling the discussion in the beginning of Section 19.21 it seems 
natural to say that U blocks a black blob B rather than hides it, since standing at the sources we 
might "see" the beginning of B, but if we try to walk any path via B we will fail before reaching 
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the top of B since U blocks the path. This distinction between hiding and blocking turns out to be 
a very important one in our lower bound proof for blob-pebbling price. Of course, if B is an atomic 
black pebble, i.e., \B\ = 1, the hiding and blocking relations coincide. 

Let us next define what it means to block a blob-pebbling configuration. 

Definition 10.2 (Unblocked paths). For [i?](W) an blob subconfiguration, the set of unblocked 
paths for [.B](W) is 

unblocked ( [B] (W)) = {P G <p via (£) | W does not block P} 

and we say that U blocks [B](W) if U blocks all paths in unblocked([.B](PV)). We say that U 
blocks the blob-pebbling configuration S if U blocks all [B] (W) G 8. If so, we say that U is a 
blocker of [B] (W) or S, respectively, or a blocking set for [B] (W) or §. 

Comparing to Section I9T21 note that when blocking a path P G $p v i a (_B), U can only use the 
white pebbles W that are associated with B in [5] (IV). Although there might be white pebbles 
from other subconfigurations [B'}(W') ^ [5](W) that would be really helpful, U cannot enlist the 
help of the white pebbles in W' when blocking B. The reason for defining the blocking relation in 
this way is that these white pebbles can suddenly disappear due to pebbling moves performed on 
such subconfigurations [B'](W). 



Reusing the definition of measure in Definition 9.8 on page 48 we generalize the concept of 
potential to blob-pebbling configurations as follows. 

Definition 10.3 (Blob-pebbling potential). The potential of an a blob-pebbling configuration § is 

pot(S) = mm{m(U) : [/blocks §} . 

If U is such that U blocks S and U has minimal measure m(U) among all blocking sets for S, we 
say that U is a minimum-measure blocking set for S. 



To compare blob-pebbling potential with the black- white pebbling potential in Definition [ 
consider the following examples with vertex labels as in Figures 171 and I914TT1 

Example 10.4. For the blob-pebbling configuration § = { [z](yi), [z](y2)}> the minimum-measure 
blocker is U = {z}. In comparison, the standard black-white pebble configuration P = (B, W) = 
({z}, {2/1,2/2}) has U = as minimum-measure hiding set. 

Example 10.5. For the blob-pebbling configuration § = { [z](0), [yi](cci, X2)}, the minimum- 
measure blocker is again U = {z}. In comparison, for the standard black-white pebble config- 
uration P = (B, W) = ({z, yi}, {xi, X2}) we have the minimum-measure hiding set U = {x^}. 

Remark 10.6. Perhaps it is also worth pointing out that Definition [KX3] is indeed a strict general- 
ization of Definition 19.91 Given a black- white pebble configuration P = (B, W) we can construct 
an equivalent blob-pebbling configuration S(P) with respect to potential by setting 

S(P) = {[b](W n G b A )\b G B] (10.1) 

but as the examples above show going in the other direction is not possible. 

Since we have accumulated a number of different minimality criteria for blocking sets, let us 
pause to clarify the terminology: 

• The vertex set U is a subset-minimal, or just minimal, blocking set for the blob-pebbling 
configuration S if no strict subset U' ^ U is a blocking set for S. 

• U is a minimum-measure blocking set for § if it has minimal measure among all blocking 
sets for S (and thus yields the potential of S). 



65 



TOWARDS AN OPTIMAL SEPARATION 



• U is a minimum-size blocking set for § if it has minimal size among all blocking sets for §. 

Note that we can assume without loss of generality that minimum-measure and minimum-size 
blockers are both subset-minimal, since throwing away superfluous vertices can only decrease the 
measure and size, respectively. However, minimum-measure blockers need not have minimal size 
and vice versa. For a simple example of this, consider (with vertex labels as in Figures 171 and I9UTTT) 
the blob-pebbling configuration § = {[.z] (103, 104)} and the two blocking sets U\ = {z} and 
U 2 = {w 1 ,w 2 }. 



10.2 A Lower Bound Assuming a Generalized LHC Property 



For the blob-pebble game, a useful generalization of Property |9.1 1 on page 48| turns out to be the 
following. 

Property 10.7 (Generalized limited hiding-cardinality property). We say that a blob-pebbling 
configuration § on a layered blob-pebblable DAG G has the Generalized limited hiding-cardinality 
property with parameter Ck if there is a vertex set U such that 

1. U blocks S, 

2. pot(S) = m(U), i.e., U is a minimum-measure blocker of S, 

3. \U\<C K - cost(S). 

For brevity, in what follows we will just refer to the Generalized LHC property. 

We say that the graph G has the Generalized LHC property with parameter Ck if all blob- 
pebbling configurations § on G have the Generalized LHC property with parameter Ck- 

When the parameter Ck is clear from context, we will just write that § or G has the Generalized 
LHC property. 

For all layered blob-pebblable DAGs Gh of height h that have the Generalized LHC property 
and are spreading, it holds that Blob-Peb(Gh) = @(h). The proof of this fact is very much in 
the spirit of the proofs of Lemma 19.121 and Theorem 19.141 although the details are slightly more 
complicated. 

Theorem 10.8 (Analogue of Theorem 19.141 1. Suppose that Gh is a layered blob-pebblable DAG 
of height h possessing the Generalized LHC property \10.7\ with some fixed parameter Ck- Then 
for any unconditional blob-pebbling V = {§0 = 0, §i> • • • > §r } of Gh it holds that 

pot (S t ) < (2C K + 1) • m&x{C0St(S s )} . (10.2) 

s<t 

In particular, for any family of layered blob-pebblable DAGs Gh that are also spreading in the 
sense of Definition ^. 34\ we have Blob-Peb(Gh) = &(h). 

We make two separate observations before presenting the proof. 

Observation 10.9. For any layered DAG G h of height h it holds that Blob-Peb(G h ) = 0(h). 



Proof. Any layered DAG Gh can be black-pebbled with h+0(l) pebbles by Theorem |9.2 on page 45 



and it is easy to see that a blob-pebbling can mimic a black pebbling in the same cost. □ 

Observation 10.10. If Gh is a layered blob-pebblable DAG of height h that is spreading in the 
sense of Definition \9.34\ then potg . ([^](0)) = h + 2. 
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Proof. The proof is fairly similar to the corresponding case for pyramids in Lemma 19.131 Note, 
though, that in contrast to Lemma 19.131 here we cannot get the statement from the Generalized 
LHC property, but instead have to prove it directly. 

Since [z] is an atomic blob, the blocking and hiding relations coincide. The set U = {z} hides 
itself and has measure h + 2. We show that any other blocking set must have strictly larger measure. 

Suppose that z is hidden by some vertex set U' / {z}. This U' is minimal without loss of 
generality. In particular, we can assume that U' is tight in the sense of Definition 19 . 1 5 1 and that U' = 
U\z§. Then by Corollary 19.231 it holds that U' is hiding-connected. Letting L = minlevel(£7') 
and setting j = h in the spreading inequality (19- 10b . we get that > 1 + h — L and hence 
m(U') > m L (U') >L + 2(l + h-L)=2h-L + 2>h + 2 since L < h. □ 

Proof of Theorem \l 0. 8\ The statement in the theorem follows from Observations 110.91 and IIP. 101 
combined with the inequality (110.21) . so just as for Theorem 19.141 the crux of the matter is the 
induction proof needed to get this inequality. 

Suppose that Ut is such that it blocks St and pot (St) = m(Ut). By the inductive hypothesis, we 
have that pot (St) < (2Ck + 1) ■ max s <t{C0S?(S s )}. We want to show for Sf+i that pot(Sf+i) < 
[2Ck + 1) • max s <t+i{cos?(S s )}. Clearly, this follows if we can prove that 

pot(St+i) < max{pot(St), [2C K + 1) • COSf(St)} . (10.3) 

We also note that if Ut blocks St+i we are done, since if so pot(St+i) < m(Ut) = pot (St). 

We make a case analysis depending on the type of move in Definition 16.81 made to get from St 
to St+i- Analogously with the proof of Lemma |9.12[ we want to show that we can use Ut to block 
Sf+i as long as the move is not an introduction on a source vertex and then use the Generalized 
LHC property to take care of such black pebble placements on sources. 

Erasure S m = S t \{[B](W)} for [B]{W) G Sf. Obviously, U t blocks S t+ i C S t . 

Inflation S t+1 = St U for [B](W) inflated from some [B'](W) G S t such that 

B' CB , (10.4a) 
W n lpp(B) C W , and (10.4b) 
B r\W = % . (10.4c) 

We claim that Ut blocks [B] (W) and thus all of St+i- Let us first argue intuitively why. 
Suppose that P is any source path agreeing with B. This path also agrees with B', and so 
must be blocked by Ut U W' by assumption. If Ut blocks B we are done. We can worry, 
though, that Ut does not block P, but that instead P was blocked by some w € W that 
disappeared as a result of the inflation move. But if w G W is on a path via B, it cannot 
have disappeared, so this can never happen. 

We now write down the formal details. With the notation in Definition I10.2[ fix any path 
P G unblocked([-B](VF)). We need to show that P fl Ut ^ 0. Let us assume without loss 
of generality that P ends in top(B), for Ut blocks [i?](W) precisely if it blocks the paths 
P n G^ p(B) for all P G unb\ocked([B](W)). We note that by definition, the fact that P 
agrees with a chain V and ends in top(V) implies that 

P C VUlpp(V) . (10.5) 

Since P agrees with B, or in formal notation P G ^ V m(B), and since B' C B by (110.4al) . 
we have P G ^ v i a (B'). By assumption, Ut blocks [!?']( W), which in particular means that 
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Ut U W intersects the path P agreeing with B'. We get 



by definition of blocking ] 
since B n W = by (fT04cl) ] 
since PCBu Jpp(B) by (fT03T> ' 
since Zpp(fi) n W C W by (flO~ 



^pn (u t u W) 
= (p n u t )u((P\B) n W) 
= (P n P t ) u (p n ipp(B) n W) 
c (p n u t ) u(Pnf) 

Pn[/ ( [PnVF = 0ifP£ unblocked([P](WO) J 

so P n Ut 7^ and the desired conclusion that C/t blocks the path P follows. 

Merger St+i = S t U {[B](W)} for [P](W) derived by merger of [Pi](Wi), [-B 2 ](W 2 ) € S t such 
that 

B 1 C\W 2 = $ , (10.6a) 
8 2 n M^i = {v*} , (10.6b) 
B = (Bx U P 2 ) \ {«*} , and (10.6c) 
W = ((Wi U W 2 ) \ {v*}) n lpp(B) . (10.6d) 

Let us again first argue informally that if a set of vertices Ut blocks two subconfigurations 
[Si](Wi) and [B2KW2}, it must also block their merger. Let P be any path via B, and 
suppose in addition that P visits the merger vertex v* . If so, P agrees with B 2 and must be 
blocked by Ut U W 2 . If on the other hand P agrees with B but does not visit u*, it is a path 
via B\ that in addition does not pass through the white pebble in W\ eliminated in the merger. 
This means that Ut U W\ \ {v*} must block P. Again, we have to argue that the blocking 
white vertices do not disappear when we apply the intersection with lpp(B) in (I10.6dl ). but 
this is straightforward to verify. 

So let us show formally that U t blocks [P](PF), i.e., that for any P £ unblocked([P](H / )) 
it holds that P n Ut / 0- As above, without loss of generality we consider only paths P 
ending in top(P) = top(Pi U B 2 ). Recall that 

B, n Wi = (10.7) 

holds for all subconfigurations by definition. We divide the analysis into two subcases. 

1- P £ *Pvia(Pi U B 2 ) = *Pvia(P U {v*}). If so, in particular it holds that P £ ^(#2) 
and since Ut blocks [B 2 ](W 2 ) we have 

jL p n (U t U W 2 ) [ by definition of blocking ] 

= (P n U t ) U ((P \ (Pi U B 2 )) n W 2 ) [ by (fT06a1> and (fToTT) ] 
= (P n U t ) u (P n Zpp(5i u P 2 ) n W 2 ) [by (fTo3T> ] 

= (P n C7t) U (P n Zpp(P U v*) n W 2 ) [just rewriting using (110.6c I ) ] 

c (P n c/ t ) u (P n (W^ 2 \ {v*}) n /pp(B) [ Zpp(P u {v*}) c / w (p) \ K} ] 

c (p n U t ) u (P n W) [by (fTOBdl) ] 

= P nU t [ since P £ unblocked([P](^)) ] 

so C/f blocks the path P in this case. 

2. P £ <Pvia(P) \ *Pvia(P U {«*}). This means that B C P but P U {v*} £ P, so the 
path P does not pass through u*. Since P agrees with B\ and C/ t blocks [Pi](Wi) by 
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assumption, we get that 

l/Pn (U t u Wi) 
= (P n u t )u((P\B) n Wt) 
= (P n u t ) u (p n (pp(B) n Wi) 
= (P n u t ) u (p n (Wi \ {«*}) n i PP (s)) 
c (p n c/" t ) u (P n w) 
= (P n c/ t ) 

and Z7t blocks the path P in this case as well. 

Introduction § t +i = §t U { [v](pred(v))}. Clearly, Ut blocks St+i if v is a non-source vertex, 
i.e., if pred(v) ^ 0, since Ut blocks §f and [v](pred(v)) blocks itself. 

Suppose however that v is a source vertex, so that the subconnguration introduced is [u](0). 
As in the proof of Lemma [9. 121 Ut does not necessarily block §t+i any longer but Ut+i = 
Ut U {v} clearly does. For j > 0, it holds that Ut+i{^j} = Ut{hj} and thus m 3 (Ut+i) = 
m j (U t ). On the bottom level j = 0, using that \U t \ < C K • COSt(S t ) Generalized LHC 
property [10.71 we have 

m (Ut +1 ) = 2-\U t+1 \ = 2-(\Ut\ + l)< 

2-(C K - cost{S t ) + 1) < 2 • (C K ■ cost(S t +i) + 1) < 
2 • (C K ■ cost(St+i) + cost(St+i)) < 2(C K + 1) • cost(St+i) (10.8) 

and we get that 

pot(S t +i) < m(U t+ i) < m.axj{m j (U t+ i)} 

< max{m(C/t), (2C K + 1) ■ COSt(8 t+1 )} = 

max{pot(S t ), (2C K + 1) • COSt(8 t+1 )} (10.9) 

which is what is needed for the induction step to go through. 

We see that regardless of the pebbling move made in the transition St ~^ §t+i, the inequality (110.31 ) 
holds. The theorem follows by the induction principle. □ 

Hence, in order to prove a lower bound on Blob-Peb(Gh) for layered spreading graphs Gh, it is 
sufficient to find some constant Ck such that these DAGs can be shown to possess the Generalized 
LHC property [l 0. 7 1 with parameter Ck- 

10.3 Some Structural Transformations 

As we tried to indicate by presenting the small toy blob-pebbling configurations in Examples 1 1 0.41 
and ll0.5l the potential in the blob-pebble game behaves somewhat differently from the potential in 
the standard pebble game. There are (at least) two important differences: 

• Firstly, for the white pebbles we have to keep track of exactly which black pebbles they can 
help to block. This can lead to slightly unexpected consequences such as the blocking set U 
and the set of white pebbles overlapping. 



[ by definition of blocking ] 
[ by (fTTIBbl) and (fTO) ] 
[ P C B U lpp(B) by (fT03T> ] 
[ since v* ^ P by assumption ] 
[by flfigffl ] 
PG unblocked([P](W))l 
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• Secondly, for black blobs there is a much wider choice where to block the blob-pebbles 
than for atomic pebbles. It seems that to minimize the potential, blocking black blobs on 
(reasonably) low levels should still be a good idea. However, we cannot a priori exclude the 
possibility that if a lot of black blobs intersect in some high-level vertex, adding this vertex 
to a blocking set U might be a better idea. 

In this subsection we address the first of these issues. The second issue, which turns out to be much 
trickier, is dealt with in the next subsection. 

One simplifying observation is that we do not have to prove Property 110.71 for arbitrary blob- 
pebbling configurations. Below, we show that one can do some technical preprocessing of the blob- 
pebbling configurations so that it suffices to prove the Generalized LHC property for the subclass 
of configurations resulting from this preprocessing!^ Throughout this subsection, we assume that 
the parameter Ck is some fixed constant. 

We start slowly by taking care of a pretty obvious redundancy. Let us say that the blob sub- 
configuration is self-blocking if W blocks B. The blob-pebbling configuration § is self- 
blocker-free if there are no self -blocking subconfigurations in S. That is, if [B] (W) is self -blocking, 
W needs no extra help blocking B. Perhaps the simplest example of this is = [v](pred(v)) 
for a non-source vertex v. The following proposition is immediate. 

Proposition 10.11. For § any blob-pebbling configuration, let §' be the blob-pebbling configura- 
tion with all self-blockers in § removed. Then COSt(S') < COSt(S), pot(S') = pot(S) and any 
blocking set U' for §>' is also a blocking set for S. 

Corollary 10.12. Suppose that the Generalized LHC property holds for self-blocker-free blob- 
pebbling configurations. Then the Generalized LHC property holds for all blob-pebbling configu- 
rations. 

Proof. If S is not self-blocker-free, take the maximal §' C § that is and the blocking set U' that the 
Generalized LHC property provides for this §'. Then U' blocks § and since the two configurations § 
and §' have the same blocking sets their potentials are equal, so pot(S) = m(U'). Finally, we have 
that \U\<C K - cost(§') < C K • COSt(S). Thus the Generalized LHC property holds for S. □ 

We now move on to a more interesting observation. Looking at § = {[z](yi), [2] (2/2)} i n 
Example 110.41 it seems that the white pebbles really do not help at all. One might ask if we could 
not just throw them away? Perhaps somewhat surprisingly, the answer is yes, and we can capture 
the intuitive concept of necessary white pebbles and formalize it as follows. 

Definition 10.13 (White sharpening). Given § = {[B i ]{W i )} i , we say that §' is a white 
sharpening of S if §' = {[B-\{W{)} ie[m] for B[ = B t and W{ C W l . 

That is, a white sharpening removes white pebbles and thus makes the blob-pebbling configu- 
ration stronger or "sharper" in the sense that the cost can only decrease and the potential can only 
increase. 

Proposition 10.14. If 8' is a white sharpening ofS it holds that C0St(S') < COSt(S) and pot(S') > 
pot(S). More precisely, any blocking set U' for S' is also a blocking set for §. 

Proof. The statement about cost is immediate from Definition 16.91 The statement about potential 
clearly follows from Definition 1 1 0. 3 1 since it holds that any blocking set U' for §' is also a blocking 
set for §. □ 

8 Note that we did something similar in Section [93] after Lemma 19.181 when we argued that if U is a minimum- 
measure hiding set for P = (B, W), we can assume without loss of generality that U U W is tight. For if not, we 
just prove the Limited hiding-cardinality property for some tight subset U' U W' C U U W instead. This is wholly 
analogous to the reasoning here, but since matters become more complex we need to be a bit more careful. 
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In the next definition, we suppose that there is some fixed but arbitrary ordering of the vertices 
in G, and that the vertices are considered in this order. 

Definition 10.15 (White elimination). For a subconfiguration and U any blocking set 

for [B](W), write W = {wi, . . . , w s }, set W° := W and iteratively perform the following for 
i = l,...,s: If UU (W l - 1 \{w i }) blocks B, set W i := W i ~ 1 \{w i }, otherwise set W i := W*' 1 . 
We define the white elimination of [B] (W) with respect to U to be W-elim([ J B] (W) , U) = [B] (W s ) 
for W s the final set resulting from the procedure above. 

For § a blob-pebbling configuration and U a blocking set for §, we define 



We say that the elimination is strict if § ^ W-elim(S, U). If § = W-elim(S, U) we say that § is 
white-eliminated, or W '-eliminated for short, with respect to U. 

Clearly W-elim(S, U) is a white sharpening of §. And if we pick the right U, we simplify the 
problem of proving the Generalized LHC property a bit more. 

Lemma 10.16. If U is a minimum-measure blocking set for §, then §' = W-elim(S, U) is a white 
sharpening ofS such that pot(S') = pot(S) and U blocks §'. 

Proof. Since §' = W-elim(S, U) is a white sharpening of § (which is easily verified from Def- 
initions [1013] and d0j3]>, it holds by Proposition 11014] that pot(S') > pot(S). Looking at the 
construction in Definition ! 10. 151 we also see that the white pebbles are "sharpened away" with care 
so that U remains a blocking set. Thus m(U) > pot(S') = pot(S) = m(U), and the lemma 
follows. □ 

Corollary 10.17. Suppose that the Generalized LHC property holds for the set of all blob-pebbling 
configurations S having the property that for all minimum-measure blocking sets U for S it holds 
that S = W-elim(S, U). Then the Generalized LHC property holds for all blob-pebbling configu- 
rations. 

Proof. This is essentially the same reasoning as in the proof of Corollary 110.121 plus induction. 
Let S be any blob-pebbling configuration. Suppose that there exists a minimum-measure blocker 
U for S such that S is not W-eliminated with respect to U. Let S 1 = W-elim(S, U). Then 
COStiS 1 ) < COSt(S) by Proposition UaU and pot(S 1 ) = pot(S) by Lemma[Jjn6] 

If there is a minimum-measure blocker U l for S 1 such that S 1 is not W-eliminated with respect 
to U 1 , set S 2 = W-elim(S 1 , U 1 ). Continuing in this manner, we get a chain S^S^S 3 , ... of 
strict W-eliminations such that COS^S 1 ) > COSt(8 2 ) > COSt(S 3 ) ... and pot(S 1 ) = pot(S 2 ) = 
pot(S 3 ) = . . . This chain must terminate at some configuration E> k since the total number of white 
pebbles (counted with repetitions) decreases in every round. 



Let U k be the blocker that the Generalized LHC property provides for S k . Then U k blocks S, 
pot(S) = pot(S fc ) = m(U k ), and \U k \ < C K ■ COSt(S k ) < C K ■ GOSf(S). Thus the Generalized 



We note that in particular, it follows from the construction in Definition 110.151 combined with 
Corollary 110.171 that we can assume without loss of generality for any blocking set U and any 
blob-pebbling configuration S that U does not intersect the set of white-pebbled vertices in S. 

Proposition 10.18. If S = W-elim(§, U), then in particular it holds that U n W(S) = 0. 



W-elim(S, U) = {W-elim([B){W),U)\[B](W) S§} . 



(10.10) 



LHC property holds for S. 



□ 



Proof. Any w G W(S) n U would have been removed in the W-elimination. 



□ 
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(a) Minimum-measure but non-tight blocking set. (b) Tight but non-connected blocker for blob. 



Figure 14: Two blob-pebbling configurations with problematic blocking sets. 
10.4 A Proof of the Generalized Limited Hiding-Cardinality Property 

We are now ready to embark on the proof of the Generalized LHC property for layered spreading 
DAGs. 

Theorem 10.19. All layered blob-pebblable DAGs that are spreading possess the Generalized 
limited hiding-cardinality property \10. 7\ with parameter Ck = 13. 

Since pyramids are spreading graphs by Theorem 19. 35 1 this is all that we need to get the lower 
bound on blob-pebbling price on pyramids from Theorem 110.81 We note that the parameter Ck 
in Theorem 110.191 can easily be improved. However, our main concern here is not optimality of 
constants but clarity of exposition. 

We prove Theorem 110.191 by applying the preprocessing in the previous subsection and then 
(almost) reducing the problem to the standard black-white pebble game. However, some twists 
are added along the way since our potential measure for blobs behave differently from Klawe's 
potential measure for black and white pebbles. Let us first exemplify two problems that arise if we 
try to do naive pattern matching on Klawe's proof for the standard black-white pebble game. 

In the standard black-white pebble game, if U is a minimum-measure hiding set for P = 
(B, W), Lemma [9. 181 tells us that we can assume without loss of generality that U U W is tight. 
This is not true in the blob-pebble game, not even after the transformations in Section [TO. 3 1 

Example 10.20. Consider the configuration § = {[wi](u2, U3), [1V4, £3] (1*4, 1*5), [x2> 2/2, z](0)} 
with blocking set U = {x2,u\,uq} in Figure [14(a) | It can be verified that U is a minimum- 
measure blocking set and that the configuration S is W-eliminated with respect to U, but the set 

U U W(S) = {ui, «2, 1*3, U4, 115, ite, 22} is not tight (because of X2). 

This can be handled, but a more serious problem is that even if the set U U W blocking the 
chain B is tight, there is no guarantee that the vertices in U U W end up in the same connected 
component of the hiding set graph TC(U U W) in Definition I9.20I 

Example 10.21. Consider the single-blob configuration § = {[its, z](0)} in Figure [14(b) | It is 
easy to verify that U = {v 4, 2/2} is a subset-minimal blocker of § and also a tight vertex set. This 
highlights the fact that blocking sets for blob-pebbling configurations can have rather different prop- 
erties than hiding sets for standard pebbles. In particular, a minimal blocking set for a single blob 
can have several "isolated" vertices at large distances from one another. Among other problems, 
this leads to difficulties in defining connected components of blocking sets for subconfigurations. 

The naive attempt to generalize Definition l9.20l of connected components in a hiding set graph 
to blocking sets would place the vertices V4 and in different connected components {^4} and 
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{1/2], none of which blocks § = {[145, z\{$)}. This is not what we want (compare Corollary 19.231 
for hiding sets for black-white pebble configurations). We remark that there really cannot be any 
other sensible definition that places and y 2 in the same connected component either, at least not 
if we want to appeal to the spreading properties in Definition 19.341 Since the level difference in U 
is 3 but the size of the set is only 2, the spreading inequality (19.101 ) cannot hold for this set. 

To get around this problem, we will instead use connected components defined in terms of 
hiding the singleton black pebbles given by the bottom vertices of our blobs. For a start, recalling 
Definitions 19. 61 and 1 1 0. 1 [ let us make an easy observation relating the hiding and blocking relations 
for a blob. 

Observation 10.22. If a vertex set V hides some vertex b £ B, then V blocks B. 

Proof. If V blocks all paths visiting b, then in particular it blocks the subset of paths that not only 
visits b but agree with all of B. □ 

We will focus on the case when the bottom vertex of a blob is hidden. 

Definition 10.23 (Hiding blob-pebbling configurations). We say that the vertex set U hides the 
subconfiguration [5](W) if U U W hides the vertex bot(B), and that U hides the blob-pebbling 
configuration S if U hides all [B]{W) G S. 

If U does not hide [B]{W), then U blocks [B](W) only if U n G^ ot(B) does. 

Proposition 10.24. Suppose that a vertex set U in a layered DAG G blocks but does not hide the 
subconfiguration \B\{W) and that [B](W) does not block itself. Then U n G^^ does not block 
[B](W), but there is a subset U' QU n G^ot^s) that blocks i B ]( w )- 

Proof. Suppose that U U W blocks B but does not hide b = bot(77), and that W does not block B. 
Then there is a source path P2 via B such that P2 n W = 0. Also, there is a source path Pi to b 
such that Pi n (U U W) = 0. Let P = (Pi n G b A ) U (P 2 n Gj) be the source path that starts 
like Pi and continues like P2 from b onwards. Clearly, 

p n ((£/ n gQ u w) = (Pi n (u u w)) u (p 2 n w) = (io.ii) 

so U n does not block [B](W). 

Suppose that U D G% does not block [B](W). Since U U W does not hide b, there is some 
source path P x to b with P x D (U U W) = 0. Also, since U U W blocks B but (U n Gj) U does 
not, there is a source path P 2 via P such that P 2 n (Z7 U W) 7^ but P 2 D (<7 U W) n G% = 0. 
But then let P = (P x n G^) U (P 2 n G%) be the source path that starts like Pi and continues 
like P 2 from 6 onwards. We get that P agrees with B and that P n (£/ U W) = 0, contradicting 
the assumption that U blocks [.B](W). □ 

We want to distinguish between subconfigurations that ai - e hidden and subconfigurations that 
are just blocked, but not hidden. To this end, let us introduce the notation 

S H (S,U) = {[B](W) e§|/7hides [B}(W)} (10.12) 

to denote the subconfigurations in S hidden by U and 

S B (S,U) =S\Sh(S,Z7) (10.13) 

to denote the subconfigurations that are just blocked. We write 

Bh(S, U) = (bot(P) I [B](W) e S H (S, 17)} ( 10 - 14 ) 
B B (S, U) = {bot(P) I [S](W) G S B (S, 17)} (10.15) 
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(a) {[S4,yi,z](v 2 ), [U 3 , W 3 ](s 3 ), [w 4 ,X 3 ]{v 5 )}. (b) {[s4,V4,W 3 ,X 3 ,y 2 ]{<l)), [W2,yi]{s 3 ,U3,Xl), [w 4 ]{^5>}- 

Figure 15: Examples of blob-pebbling configurations with hidden and just blocked blobs. 



to denote the black bottom vertices in these two subsets of subconfigurations and note that we can 
have U) n U) / 0. The white pebbles in these subsets located below the bottom 

vertices of the black blobs that they are supporting are denoted 



and 



W&(S,U) = {W n G b A \[B]{W) G §h(§,*7), 6 = bot(5)} 
W£(§, U) = {W n Gl\[B](W) G Ss(S, 17), 6 = bot(S)} . 



(10.16) 
(10.17) 



This notation will be used heavily in what follows, so we give a couple of simple but hopefully 
illuminating examples before we continue. 

Example 10.25. Consider the blob-pebbling configurations and blocking sets in Figure [15] For 
the blob-pebbling configuration Si = { [54, y\, z\(v2), [u 3 , 103] (S3), [104, x 3 ](v5) } with blocking 
set U\ = {t>3, V4} in Figure [T5(a)| the vertex set {^4,^5} hides W4 = bot([u>4, x 3 ]) but [S4, yi,z] 
is blocked but not hidden by {^2,^3,^4} and [U3, 1^3] is blocked but not hidden by {v 3 }. Thus, we 
have 





Ui) 


= {[m,x 3 ](v 5 }} 


S B (Si 


Ux) 


= {[s4,yi,z}(v 2 ), [u 3 , w 3 ](s 3 )} 


B H $i 


Ui) 


= {m} 


B b (Si 


Ui) 


= {S4,U 3 } 


W^(Si 


Ux) 


= w 




Ui) 


= {s 3 } 



in this example. For the configuration S 2 = {[s4,V4,,w 3 , x 3 ,y 2 }{®) , [^2, yi](s3, ""3, 21), [104] (v 5 )} 
with blocker U2 = {s2, U4, U5} in Figure [T5(b)[ it is straightforward to verify that 

§#(§2,^2) = {[W2,yi](s 3 ,n 3 ,xi), [W4,}(V 5 }} 

§b(§2,£7>) = {[S4,f4,^3,^3,y2](0)} 

Sb(§2, £7 2 ) = {1^2,^4} 
Sb(S 2 ,C/ 2 ) = {s 4 } 

W^(S 2 ,C/2) = {S3,«3^5} 

W^(S 2 ,C/ 2 ) = 
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are the corresponding sets. 

Let us also use the opportunity to illustrate Definition 110.151 The blob-pebbling configuration 
Si is not W-eliminated with respect to U\, since U\ also blocks this configuration with the white 
pebble on S3 removed. However, a better idea measure-wise is to change the blocking set for Si to 
U[ = {54, V4}, which has measure m(U{) = 4 < 6 = m(Ui). The vertex set U2 can be verified to 
be a minimum-measure blocker for S2, but when S2 is W-eliminated with respect to C/ 2 the white 
pebble on x\ disappears. 

As a final remark in this example, we comment that although we have not indicated explicitly 
in Figures |15(a)| and |15(b)| which white pebbles W are associated with which black blob B (as 
was done in Figure |14(a)| ), this is uniquely determined by the requirement in Definition 16.71 that 
W C lpp(B). 

For the rest of this section we will assume without loss of generality (in view of Proposi- 
tion 110.111 and Corollary 110.171) that we are dealing with a blob-pebbling configuration S and a 
minimum-measure blocker UofS such that S is free from self -blocking subconfigurations and is 
W-eliminated with respect to U. As an aside, we note that it is not hard to show (using Defini- 
tion \IQA5\ and Proposition 1 10.241) that this implies that W^(S, U) = 0. We will tend to drop the 
arguments S and U for Sjj,^b,Bh,Bb, Wjj, and W^, since from now on the blob-pebbling con- 
figuration S and the blocker U will be fixed. With this notation, Theorem 110.191 clearly follows if 
we can prove the following lemma. 

Lemma 10.26. Let S be any blob-pebbling configuration on a layered spreading DAG and U be 
any blocking set for S such that 

1. pot(S) = m(U), i.e., U is a minimum-measure blocker ofE>, 

2. S is free from self-blocking subconfigurations and is W-eliminated with respect to U, and 

3. U has minimal size among all blocking sets U' for S such that pot(S) = m(U'). 
Then \U\ < 13 • \B H U B B U W£|. 

The proof is by contradiction, although we will have to work harder than for the corresponding 
Theorem l9. 25 I for black- white pebbling and also use (the proof of) the latter theorem as a subroutine. 
Thus, for the rest of this section, let us assume on the contrary that U has all the properties stated in 
Lemma [Ta26l but that \U\ > 13- \B H U B b U W#|. We will show that this leads to a contradiction. 

For the subconfiguration in S# that are hidden by U, one could argue that matters should be 
reasonably similar to the case for standard black-white pebbling, and hopefully we could apply 
similar reasoning as in Section l931 to prove something useful about the vertex set hiding these sub- 
configurations. The subconfigurations in S# that are just blocked but not hidden, however, seem 
harder to get a handle on (compare Example 1 10.2 II) . 

Let Uh Q U be a smallest vertex set hiding S# and let Ub = U \ Uh- The set Ub consists 
of vertices that are not involved in any hiding of subconfigurations in S>h, but only in blocking 
subconfigurations in S^ on levels above their bottom vertices. As a first step towards proving 
Lemma [10. 26 1 and thus Theorem 1 10. 191 we want to argue that U b cannot be very large. 

Consider the blobs in S>b- By definition they are not hidden, but are blocked at some level above 
level(bot(5)). Since the vertices in Ub are located on high levels, a naive attempt to improve the 
blocking set would be to pick some u G Ub and replace it by the vertices in Bb corresponding 
to the subconfigurations in S# that u is involved in blocking, i.e., by the set B u = {bot(B)u7 \ 
{u} does not block G S#}- Note that B u is lower down in the graph than u, so (U\{u}) U 

B u is obtained from U by moving vertices downwards and by construction (U\{u}) U B u blocks S. 
But by assumption, U has minimal potential and cardinality, so this new blocking set cannot be an 
improvement measure- or cardinality-wise. The same holds if we extend the construction to subsets 
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U' C Ub and the corresponding bottom vertices B u ' C Bb- By assumption we can never find any 
subset such that (U \ {U'}) U B u is a better blocker than U. It follows that the cost of the blobs 
that Ub helps to block must be larger than the size of Ub, and in particular that \Ub\ < \Bb\- Let 
us write this down as a lemma and prove it properly. 

Lemma 10.27. Let § be any blob-pebbling configuration on a layered DAG and U be any blocking 
set for § such that pot(S) = m(U), U has minimal size among all blocking sets U' for § with 
pot(S) = m(U'), and § is free from self-blocking subconfigurations and is W -eliminated with 
respect to U. Then ifU B ^ U is any smallest set hiding S B and Ub = U \Uh, it holds that 
\U B \ < \B B \. 

Before proving this lemma, we note the immediate corollary that if the whole blocking set 
U is significantly larger than COSt(S), the lion's share of U by necessity consists not of vertices 
blocking subconfigurations in S B , but of vertices hiding subconfigurations in And recall that 
we are indeed assuming, to get a contradiction, that U is large. 

Corollary 10.28. Assume that Sand U are as in Lemma \l0.26\ but with \ U\ > 13- \Bh UBbU W# . 
Let U H QU be a smallest set hiding S H . Then \U H \ > 12 - \B H U B B U W H \. 

As was indicated in the informal discussion preceding Lemma 110.271 the proof of the lemma 
uses the easy observation that moving vertices downwards can only decrease the measure. 

Observation 10.29. Suppose that U, V% and V 2 ar e vertex sets in a layered DAG such that U H V 2 = 
and there is a one-to-one {but not necessarily onto) mapping f : V\ 1— ► V 2 with the property that 
level (v) < level(/(w)). Then m(U U Vf) < m(U U V 2 ). 



Proof. This follows immediately from Definition |9. 8 on page 48 since the mapping / tells us that 



\(uuvi){hj}\ < |f/i>j}| + |Mhj}| < l^i}| + l/(M^i})l 

< \u{yj}\ + \v 2 {yj}\ < \(u u v 2 ){hj}\ 

for all j. □ 

Proof of Lemma \1027} Note first that by Proposition [10241 for every [B](W) E E>b with b = 
bot(B) it holds that U n = (U H U U B ) n G% blocks [B]{W). Therefore, all vertices in U B 
needed to block [B](W) can be found in Ub H G%. Rephrasing this slightly, the blob-pebbling 
configuration S is blocked by Uh U (U B n UfeGB s )' an ^ smce U is subset-minimal we get that 

U B = U B n \JbeB B G b ■ (10.18) 

Consider the bipartite graph with Bb and U B as the left- and right-hand vertices, where the neigh- 
bours of each b G Bb are the vertices N(b) = Ub H in [7^ above 6. We have that N(Bb) = 
Ub n \J beBB G% = U B by (fToTBl . Let B' Cfi B bea largest set such that \N(B') \ < \B'\. If 
B' = Bb we are done since this is the inequality \Ub\ < \&b\- Suppose therefore that B' ^ Bb 
and \U B \ = \N(B B )\ > \B B \. 

For all B" CB B \B' we must have | N [B") \N(B') | > \B"\, for otherwise B" could be added 
to B' to yield an even larger set B* = B' U £>" with |iV"(B*) | < \B*\ contrary to the assumption 
that B' has maximal size among all sets with this property. It follows by Hall's marriage theorem 
that there must exist a matching of B B \ B' into N[B B \ B') \ N(B') = U B \ N(B'). Thus, 
\B B \ B'\ < j U B \ N (B 1 ) | and in addition it follows from the way our bipartite graph is constructed 
that every b G Bb \ B' is matched to some u G U B \ N(B') with level (u) > level (fe). 

Clearly, all subconfigurations in 

= {[B]{W) G S B | bot(5) G B B \ B'} (10.19) 
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are blocked by Bb \B' (even hidden by this set, to be precise). Also, as was argued in the beginning 
of the proof, every [B]{W) eS B with b = bot(B) is blocked by U H U (U B D G%) = U H UN(b), 
so all subconfigurations in 

S| = {[B](W) E S B | bot(B) e B'} (10.20) 

are blocked by Uh U iV(^') where |A r (^')| < |£>'|. And we know that E>h is blocked (even 
hidden) by Uh- It follows that if we let 

U* = U H U iV(B') U (B B \ B') (10.21) 

we get a vertex set 17* that blocks S>h U §}j U S b = S, has measure m(U*) < m(U) because of 
Observation 1 1 0. 29 1 and has size 

\U*\ < \U H \ + \N{B')\ + \B B \B'\ < \U H \ + \B'\ + |fl B = \U\ (10.22) 

strictly less than the size of U. But this is a contradiction, since U was chosen to be of minimal 
size. The lemma follows. □ 

The idea in the remaining part of the proof is as follows: Fix some smallest subset Uh Q U 
that hides and let U B = U \ Uh- Corollary 1 10.281 says that Uh is the totally dominating part 
of U and hence that Uh is very large. But Uh hides the blob subconfigurations in S# very much 
in a similar way as for hiding sets in the standard black-white pebble game. And we know from 
Section 19.31 that such sets need not be very large. Therefore we want to use Klawe-like ideas to 
derive a contradiction by transforming Uh locally into a (much) better blocking set for The 
problem is that this might leave some subconfigurations in not being blocked any longer (note 
that in general Ub will not on its own block §#). However, since we have chosen our parameter 
Ck = 13 for the Generalized LHC property 110.71 so generously and since the transformation in 
Section 1931 works for the (non-generalized) LHC property with parameter 1, we expect our locally 
transformed blocking set to be so much cheaper that we can afford to take care of any subconfigu- 
rations in §b that are no longer blocked simply by adding all bottom vertices for all black blobs in 
these subconfigurations to the blocking set. 

We will not be able to pull this off by just making one local improvement of the hiding set as 
was done in Section 1931 though. The reason is that the local improvement to Uh could potentially 
be very small, but lead to very many subconfigurations in Ss becoming unblocked. If so, we 
cannot afford adding new vertices blocking these subconfigurations without risking to increase the 
size and/or potential of our new blocking set too much. To make sure that this does not happen, 
we instead make multiple local improvements of Uh simultaneously. Our next lemma says that we 
can do this without losing control of how the measure behaves. 

Lemma 10.30 (Generalization of Lemma l9.30l) . Suppose that U\, . . . , Uk, V%, . . . , Vk, Y are ver- 
tex sets in a layered graph such that for all i, j £ [k], i ^ j, it holds that U{ Vi, V{ D Vj = 0, 
Ui n Vj = and Y n V t = 0. Thenm(Y U ULi &i) < m(Y U ULi Vi). 

Proof. By induction over k. The base case k = 1 is Lemma [9.30 on page 56| 

For the induction step, let Y' = Y U Ufci u i- Since u k V k and Y' l~l Vk = by 
assumption, we get from Lemma |9.30| fhat 

m[Y U Uti Ui) =m(Y'u U k ) < m(Y' U V k ) = m(Y U U£i U i u V k) • ( 10 - 23 ) 

Letting Y" = Y U 14, we see that (again by assumption) it holds for alH, j G [k — 1], i ^ j, that 
Ui ^ m Vi, Vi n Vj = 0, Ui n Vj = and Y" D Vi = 0. Hence, by the induction hypothesis we 
have 

m{Y U Uti U i u Vk) =™{Y" U U*-\ Ui) <m(Y" U U=\ V) =m(7U Uti V) (10.24) 
and the lemma follows. □ 
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We also need an observation about the white pebbles in 

Observation 10.31. For any [B](W) € 8 H with b = bot(B) it holds thatW = Wf] G b A . 

Proof. This is so since § is W-eliminated with respect to U. Since U U W hides b = bot(B), 
any vertices in W n G^ are superfluous and will be removed by the W-elimination procedure in 
Definition UOlSl □ 

Recalling from (TT07T6l> that W H = {W n Gft| G b = bot(B)} this leads to the 

next, simple but crucial observation. 



Observation 10.32. The vertex set Uh U hides the vertices in Bh in the sense of Definition 

That is, we can consider (B H , W H ) to be (almostf] a standard black-white pebble configura- 
tion. This sets the stage for applying the machinery of Section 1931 

Appealing to Lemma 9.18 on page 52[ let X C Uh U W h be the unique, minimal tight set such 

that 

P1 = ^UW^] (10.25) 

and define 

W£ = W H n X (10.26a) 
U T = U H n X (10.26b) 

to be the vertices in W H and Uh that remains in X after the bottom-up pruning procedure of 
Lemma |9T8l 

Let H = H(G, X) be the hiding set graph of Definition [97201 for X = U T 0W£. Suppose that 
Vx, ■ ■ ■ , Vk are the connected components of Tt, and define for i = 1, . . . , k the vertex sets 

B H = B H n V t (10.27a) 
W H = W H n V- (10.27b) 
U l H = U H n ^ (10.27c) 

to be the black, white and "hiding" vertices within component Vi, and 

W4 = w£ n V< (10.27d) 
C4 = J7 T n V, (10.27e) 

to be the vertices of W H and Uh in component V^ that "survived" when moving to the tight sub- 
set X. Note that we have the disjoint union equalities W£ = [] k l=1 W H , U H = U*=i^. et cetera 
for all of these sets. 

Let us also generalize Definition 19.81 of measure and partial measure to multi-sets of vertices 
in the natural way, where we charge separately for each copy of every vertex. This is our way of 
doing the bookkeeping for the extra vertices that might be needed later to block in the final step 
of our construction. 

This brings us to the key lemma stating how we will locally improve the blocking sets. 

Lemma 10.33 (Generalization of Lemma l9.36b . With the assumptions on the blob-pebbling con- 
figuration S and the vertex set U as in Lemma \10.26\ and with notation as above, suppose that 
U' H U Wjj hides B H , that Ti.(U^ U W^) is a connected graph, and that 

\U H \ >6 - \B H U W H \ ■ (10.28) 



9 Not quite, since we might have Bh n Wh 7^ 0- But at least we know that Uh n Wh = by W-elimination and 
the roles of U and W in U U W are fairly indistinguishable in Klawe's proof anyway, so this does not matter. 
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Then we can find a multi-set XJ\ C [[[/j, U W^"f| f/m? /j/Jes vertices in B l H , has I |C^"|/3| extra 
copies of some fixed but arbitrary vertex on level L\j = maxlevel([/]^), and satisfies XJ\ U 



11 



and \ Ul\ < | Ujj | ( where U\ is measured and counted as a multi-set with repetitions). 



Proof. Let U\ be the set found in Lemma 9.33 on page 57 1 which certainly is in \U\. U W^H, 
together with the prescribed extra copies of some (fixed but arbitrary) vertex that we place on level 
maxlevel ( \U % n U W^fl) > Ljj to be on the safe side. By Lemma 19331 U\ hides B % H , and the size 
of U\ counted as a multi-set with repetitions is 



\vi\ < |^| + [\u l H \/z\ <(| + |)- \vh\ < 



(10.29) 



It remains to show that U\ ^ m U l H . 

The proof of this last measure inequality is very much as in Lemma l9.36l but with the distinction 
that the connected graph that we are dealing with is defined over U^UW^, but we count the vertices 
in U % H U W H . Note, however, that by construction these two unions hide exactly the same set of 
vertices, i.e., 

P4 u = Wh u W h J . (io.30) 



Recall that by Definition 9.29 on page 56| what we need to do in order to show that U\ 



W H is 



to find for each j an / < j such that m? \U\ 
into two cases. 



< m (Ujr). As in Lemma l9.36l we divide the proof 



1. If j < minlevel([4 U W l T ) = minlevel^ U W H ), we get 



mi(Ul)=j + 2 

< i + 2 

< j + 2 

< j + 2 
= j + 2 
= m j (U 



\Ul{hj}\ 

\ u i\ 

(l^kl + [l^l/3j) 
\ U h\ 

\Uh{hj}\ 



HJ 



by definition of m-'(-) ] 

since V{>: j} C V for any V ] 

by Lemma l9.33l plus extra vertices ] 

by the assumption in (110.281 ) ] 

Uff{h j} = U\j since j < mmleve^f/j^) 
by definition of m 3 {■) ] 



and we can choose / = j in Definition 19.291 



2. Consider instead j > minlevel([/^ U VU^.) and let L = minleve^C/^ U Wy). Since the 
black pebbles in B l H are hidden by Uj, U W^, i.e., B % H C \\U^ U Wj^\ in formal notation, 
recollecting Definition 19. 3 1 1 and Observation 19.321 part|2j we see that 



(10.31) 



for all j. Also, since C/^ U is a hiding-connected vertex set in a spreading graph G, 
combining Definition [934] with the fact that Uj, U C U l H U W]^ we can derive that 



j + Lyj^ui, u w4H) < £ + \Ut u w4^| < L + I^h u w)* I ■ 

Together, (110.311 ) and (110.321 ) say that 

j + Ly^B^) <L+\W H U 1<| 



(10.32) 



(10.33) 
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and using this inequality we can show that 

m j (Ui)=j + 2-\ut{yj}\ 

<j + L tj (B i H ) + \B\ I \ + 2.[\Ul I \/3\ 
<L+|E^UW^| + |B^|+2-L|C^|/3j 

<l + §|c4| + 1^1 + 1-^1 
<L + %\ui 



<L + 2-\U 



+ 2- \B l H U W H \ 
h\ 

L + 2-\U l H {hL}\ 



by definition of m 3 {-) ] 
by Lemma [9.331 + extra vertices 
using the inequality (110.331) ] 
\A U B\ < \A\ + \B\ ] 
\A\ + \B\ < 2- \A U B\ ] 
by the assumption in (110.281) ] 
since L < minlevel(L^) ] 
by definition of m L {-) 1 



Thus, the partial measure of U l H at the minimum level L is always at least as large as the 
partial measure of Ul at levels j above this minimum level, and we can choose I = L in 
Definition l9~29l 



Consequently, Ul ^ m U l H and the lemma follows. 



□ 



Now we want to determine in which connected components of the hiding set graph TL we should 
apply Lemma [10.331 Loosely put, we want to be sure that changing U % H to Ul is worthwhile, i.e., 
that we gain enough from this transformation to compensate for the extra hassle of reblocking blobs 
in §b that turn unblocked when we change UL. With this in mind, let us define the weight of a 
component Vi in TL as 



wiVi) 



C/jfl/6] if \U\j\ > 6- \& H U W H \ 
otherwise. 



(10.34) 



The idea is that a component Vi has large weight if the hiding set U l H in this component is large 
compared to the number of bottom black vertices in B l H hidden and the white pebbles W l H helping 
Ujj to hide B l H . If we concentrate on changing the hiding sets in components with non-zero weight, 
we hope to gain more from the transformation of U l H into Ul than we lose from then having to 
reblocking And since Ur is large, the total weight of the non-zero-weight components is 
guaranteed to be reasonably large. 

Proposition 10.34. With notation as above, the total weight of all connected components Vy, . . . ,14 
in the hiding set graph H = H(G, U T U W$) is Yll=i > \B H U Bb U W£|. 

Proof. The total size of the union of all subsets U l H C Uh with sizes \U l H \ < 6 • \B l H U 
resulting in zero-weight components Vi in TL is clearly strictly less than 



k 

J2\B { H U W H \ = 6 • \B H U Wh\ < 6 • \B H U B B U 

-i=i 



(10.35) 



Since according to Corollary [TO. 28 1 we have that \U H \ > 12 • \Bh U Bb U W^|, it follows that the 
size of the union U™(Vi)>o °f a ^ subsets U l H corresponding to non-zero-weight components Vi 
must be strictly larger than 6 • \B H U B B U W^|. But then 



w(Vi)>0 w(Vi)>0 



> 



u 



U 



H 



w(Vi)>0 



> \B H U B B U W&l 



(10.36) 



as claimed in the proposition. 



□ 
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We have now collected all tools needed to establish the Generalized limited hiding-cardinality 
property for spreading graphs. Before we wrap up the proof, let us recapitulate what we have shown 
so far. 

We have divided the blocking set U into a disjoint union Uh U Ub of the vertices Ujj not only 
blocking but actually hiding the subconfigurations in S# C S, and the vertices Ub just helping Ujj 
to block the remaining subconfigurations in §b = § \ I n Lemma [10.271 and Corollary 110.281 
we proved that if U is large (which we are assuming) then Ub must be very small compared to Uh, 
so we can basically just ignore Ub- If we want to do something interesting, it will have to be done 
with Uh- 

And indeed, Lemma [10.331 tells us that we can restructure Uh to get a new vertex set hiding 
§>H and make considerable savings, but that this can lead to §g no longer being blocked. By 
Proposition 110.341 there is a large fraction of Uh that resides in the non-zero-weight components of 
the hiding set graph TL (as defined in Equation (I10.34I )). We would like to show that by judiciously 
performing the restructuring of Lemma [T 0.33 1 in these components, we can also take care of 

More precisely, we claim that we can combine the hiding sets U\ from Lemma 110.331 with 
some subsets of Uh U Ub and Bb into a new blocking set U* for all of §# U Sb = S in such 
a way that the measure m(U*) does not exceed m(U) = pot(§) but so that \U*\ < \U\. But 
this contradicts the assumptions in Lemma [10.261 It follows that the conclusion in Lemma [10.261 
which we assumed to be false in order to derive a contradiction, must instead be true. That is, 
any set U that is chosen as in Lemma [10. 26 1 must have size \U\ < 13 • \Bh U Bb U W|h. This 
in turn implies Theorem 1 10. 191 i.e., that layered spreading graphs possess the Generalized limited 
hiding-cardinality property that we assumed in order to get a lower bound on blob-pebbling price, 
and we are done. 

We proceed to establish this final claim. Our plan is once again to do some bipartite match- 
ing with the help of Hall's theorem. Create a weighted bipartite graph with the vertices in Bb = 
{bot(-B)| E §b} on the left-hand side and with the non-zero-weight connected compo- 

nents among V\, ■ ■ ■ , \% in TL in the sense of (110.341 ) acting as "supervertices" on the right-hand 
side. Reorder the indices among the connected components V\ , ■ ■ ■ , if needed so that the non- 
zero-weight components are V\ , . . . , V&'. All vertices in the weighted graphs are assigned weights 
so that each right-hand side supervertex Vi gets its weight according to (1 10.34b . and each left-hand 
vertex has weie htl0 We define the neighbours of each fixed vertex b E Bb to be 

N(b) = {Vi\w(Vi) > and maxlevel > level(b)} , (10.37) 

i.e., all non-zero- weight components Vi that contain vertices in the hiding set Uh that could possibly 
be involved in blocking any subconfiguration [B](W) £ having bottom vertex bot(B) = b. 
This is so since by Proposition 110.241 any vertex u E Uh helping to block such a subconfiguration 
[B] (W) E §b must be strictly above b, so if the highest-level vertices in U l H are on a level below b, 
no vertex in U H can be responsible for blocking [B](W). 

Let B' C B B be a largest set such that w(N(B')) < \B'\. We must have 

N(B')^{jtiVi (10.38) 

since w([jf =1 V { ) > |fe U B B U > \B B \ by Proposition EOU For all B" QB B \B' it 

holds that 

w(N(B") \ N(B')) > \B"\ (10.39) 

since otherwise B' would not be of largest size as assumed above. The inequality (110.391 ) plugged 
into Hall's marriage theorem tells us that there is a matching of the vertices in Bb \ B' to the 

I0 Or, if we like, we can equivalently think of an unweighted graph, where each Vi is a cloud of w(Vi) unique and 
distinct vertices, and where N(b) in d!0.37t always containing either all or none of these vertices. 
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components in (Ji=i \ N(B') / with the property that no component Vi gets matched with 
more than w(Vi) vertices from Bb\B' . 

Reorder the components in the hiding set graph TL so that the matched components in Ti 
are V\, . . . , V m and the rest of the components are V m+ i, . . . , \% and so that U H , . . . , U^ and 
Uff +1 , . . . , U H are the corresponding subsets of the hiding set Uh- Then pick good local blockers 
Ul C Vi as in Lemma [l0.33l for all components V\, . . . , V m . Now the following holds: 

1. By construction and assumption, respectively, the vertex set U£i ^* u Ui= m +i blocks 
(and even hides) 

2. All subconfigurations in 

= |[-B](iy) G S_b| bot(S) G B'} (10.40) 

are blocked by [/g U N(B') = Ub U U^ =m+1 f/Jj, as we have not moved any elements 
in [7 above 

3. With notation as in Lemma 110.301 let Y = U B U Ui= m +i and consider ^ and 

for i = 1, . . . , m. We have Ul ;j m for i = 1, . . . , m by Lemma fl 0.33 1 Also, since 
^Zff n C/b = and f7* C Vi and C/j^ C Vi for Vi, . . . , T4 pairwise disjoint sets of vertices, 
it holds for all i, j G [m], i ^ j, that C/J n Ui = 0, n U J H = 0, t/J n U H = and 
y Pi E/^ = 0. Therefore, the conditions in Lemma [10.301 are satisfied and we conclude that 

m(u B u U£i ^ u Ut m+ i e&) = m(y u U^i &2) 

< m(y U U™i ^ff) 

" V Ul " X H; , (10.41) 

u ur=i ^ u ut m+ i uh) 



m 
m(U) 



where we note that Ub U U2=i ^* U Uj=m+l * s meas ured as a multi-set with repetitions. 
Also, we have the strict inequality 

\Ub U U"i ^ U Ut m+ i < \U\ , (10.42) 
where again the multi-set is counted with repetitions. 
4. It remains to take care of the potentially unblocked subconfigurations in 

§ B = {[B](W) GS B |bot(B) eB B \B'} . (10.43) 

But we derived above that there is a matching of Bb \ B' to Vi, . . . , V m such that no Vi is 
chosen by more than 

«W) = \\Uh\/S] < [\Uh\/3\ (10.44) 

vertices from Bb \ B' (where we used that \U H \ > 6 if w(Vi) > to get the last inequality). 
This means that there is a spare blocker vertex in U* for each b G Bb \ B' that is matched 
to Vi. Also, by the definition of neighbours in our weighted bipartite graph, each b is matched 
to a component with maxlevel(E7j^) > level (b). By Observation 1 1 0. 29 1 lowering these spare 
vertices from m&xlevel(U H ) to level(b) can only decrease the measure. 

Finally, throw away any remaining multiple copies in our new blocking set, and denote the resulting 
set by U*. We have that U* blocks § and that m(U*) < m(U) but \U*\ < \U\. This is a 
contradiction since U was chosen to be of minimal size, and thus Lemma [10.261 must hold. But 
then Theorem 110.191 follows immediately as well, as was noted above. 
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10.5 Recapitulation of the Proof of Theorem LUJ and Optimality of Result 

Let us conclude this section by recalling why the tight bound on clause space for refuting pebbling 
contradictions in Theorem [TTTJ now follows and by showing that the current construction cannot be 
pushed to give a better result. 

Theorem 10.35 (rephrasing of Theorem [TTTJ. Suppose that Gh is a layered blob-pebblable DAG 
of height h that is spreading. Then the clause space of refuting the pebbling contradiction Peb Gh 
of degree d > 1 by resolution is Sp(Peb Gh h 0) = 0(h). 

Proof. The 0(h) upper bound on clause space follows from the bound Peb(Gh) < h + 0(1) on 
the black pebbling price in Lemma 9.2 on page 45 combined with the bound Sp (Peb G h 0) < 



PebiG) + 0(1) from Proposition 4.15 on page 17 



For the lower bound, we instead consider the pebbling formula *Peb G without target ax- 



ioms x(z) 1 , . . . , x(z) d and use that by Lemma 7.1 on page 30] it holds that Sp {Peb Gh h 



Sp(*Peb Gh h Vi=i x(z)i). Fix any resolution derivation vr : *Peb Gh h \jf =1 x(z)i and let V n be 



the complete blob-pebbling of the graph G associated to it in Theorem |7.3 on page 31 such that 



COSt(V w ) < maxce7r{cos?(S(C))} + 0(1). On the one hand, Theorem |8.5 on page 41| says that 
COSt(8(C)) < |C| provided that d > 1, so in particular it must hold that COSt^V^) < Sp(ir)+0(l). 
On the other hand, COStiV^) > Blob-Peb(G h ) by definition, and by Theorems MM and [Tall] it 
holds that Blob-Peb(G h ) = Q(h). Thus Sp(ir) = tt(h), and the theorem follows. □ 

Plugging in pyramid graphs 11^ in Theorem 1 10.351 we get fc-CNF formulas F n of size 0(n) 
with refutation clause space Q(yjn). This is the best we can get from pebbling formulas over 
spreading graphs. 

Theorem 10.36. Let G be any layered spreading graph and suppose that Peb G has formula size 
and number of clauses 0(n). Then Sp(Peb G h 0) = O(yjn). 

Proof. Suppose that G has height h. Then Sp(Peb G h 0) = 0(h) as was noted above. The size 
of Peb G , as well as the number of clauses, is linear in the number of vertices |V(G)|. We claim 
that the fact that G is spreading implies that |F(G)| = S7(/i 2 ), from which the theorem follows. 

To prove the claim, let Vl denote the vertices of G on level L. Then |V(G)| = ^L=ol^l- 
Obviously, for any L the set Vl hides the sink z of G. Fix for every L some arbitrary minimal subset 
V' L C Vi hiding z. Then V' L is tight, the graph T~C(V L ) is hiding-connected by Corollary 19.231 
and setting j = h in the spreading inequality (19.101) we get that \V[\ > 1 + h — L. Hence 

\v(G)\>E h L =o\vL\ = n(h 2 ). " " " □ 

The proof of Theorem 110.361 can also be extended to cover the original definition in ll37l of 
spreading graphs that are not necessarily layered, but we omit the details. 



11 Conclusion and Open Problems 

We have proven an asymptotically tight bound on the refutation clause space in resolution of peb- 
bling contr adictions over pyramid graphs. This yields the currently best known separation of length 
and clause space in resolution. Also, in contrast to previous polynomial lower bounds on clause 
space, our result does not not follow from lower bounds on width for the corresponding formulas. 
Instead, a corollary of our result is an exponential improvement of the separation of width and space 
in ll42l . This is a first step towards answering the question of the relationship between length and 
space posed in, for instance, |[TTl[29ll57l . 
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More technically speaking, we have established that for all graphs G in the class of "layered 
spreading DAGs" (including complete binary trees and pyramid graphs) the height h of G, which 
coincides with the black-white pebbling price, is an asymptotical lower bound for the refutation 
clause space Sp (Pet^ h 0) of pebbling contradictions Peh d G provided that d > 2. Plugging in 
pyramid graphs we get an Q,(^/n) bound on space, which is the best one can get for any spreading 
graph. 

An obvious question is whether this lower bound on clause space in terms of black-white peb- 
bling price is true for arbitrary DAGs. In particular, does it hold for the family of DAGs {G n }^? =1 
in f3~TTl of size O(n) that have maximal black-white pebbling price BW-Peb(G n ) = 0(n/logn) 
in terms of size? If it could be proven for pebbling contradictions over such graphs that pebbling 
price bounds clause space from below, this would immediately imply that there are fc-CNF formulas 
refutable in small length that can be maximally complex with respect to clause space. 

Open Problem 1. Is there a family of unsatisfiable k-CNF formulas {F n }^ =1 of size 0(n) such 
that L(F n h 0) = O(n) and W(F n h 0) = 0(1) but Sp(F n h 0) = Q(n/\ogn)? 

We are currently working on this problem, but note that these DAGs in ll3li seem to have much 
more challenging structural properties that makes it hard to lift the lower bound argument from 
standard black-white pebblings to blob-pebblings. 

A second question, more related to Theorem 11.31 and the other trade-off results presented in 
Section [5J is as follows. We know from lfT5l (see Theorem 14.21 ) that short resolution refutations 
imply the existence of narrow refutations, and in view of this an appealing proof search heuristic is 
to search exhaustively for refutations in minimal width. One serious drawback of this approach is 
that there is no guarantee that the short and narrow refutations are the same one. On the contrary, 
the narrow refutation n' resulting from the proof in lfT"5l is potentially exponentially longer than the 
short proof it that we start with. However, we have no examples of formulas where the refutation in 
minimum width is actually known to be substantially longer than the minimum-length refutation. 
Therefore, it would be valuable to know whether this increase in length is necessary. That is, is 
there a formula family which exhibits a length-width trade-off in the sense that there are short 
refutations and narrow refutations, but all narrow refutations have a length blow-up (polynomial or 
superpolynomial)? Or is the exponential blow-up in lfT31 just an artifact of the proof? 

Open Problem 2. If F is a k-CNF formula over n variables refutable in length L, is it true that 
there is always a refutation n of F in width W(ir) = 0(^/n log with length no more than, say, 
L(tt) = O(L) or at most poly(Z) ? 

A similar trade-off question can be posed for clause space. Given a refutation in small space, 
we can prove using (5J (see Theorem 14.51 ) that there must exist a refutation in short length. But 
again, the short refutation resulting from the proof is not the same as that with which we started. 
For concreteness, let us fix the space to be constant. If a polynomial-size /c-CNF formula has a 
refutation in constant clause space, we know that it must be refutable in polynomial length. But can 
we get a refutation in both short length and small space simultaneously? 

Open Problem 3. Suppose that {F n }°° =1 is a family of polynomial-size k-CNF formulas with 
refutation clause space Sp{F n h 0) = 0(1). Does this imply that there are refutations Tr n : F n hO 
simultaneously in length L(ir n ) = poly(n) and clause space Sp{ir n ) = 0(1)? 

Or can it be that restricting the clause space, we sometimes have to end up with really long 
refutations? We would like to know what holds in this case, and how it relates to the trade-off 
results for variable space in (33j. 

Finally, we note that all bounds on clause space proven so far is in the regime where the clause 
space Sp(ir) is less than the number of clauses \F\ in F. This is quite natural, since the size of the 
formula can be shown to be an upper bound on the minimal clause space needed 11281 . 
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Such lower bounds on space might not seem too relevant to clause learning algorithms, since 
the size of the cache in practical applications usually will be very much larger than the size of the 
formula. For this reason, it seems to be a highly interesting problem to determine what can be said 
if we allow extra clause space. Assume that we have a CNF formula F of size roughly n refutable 
in length L(F h 0) = L for L suitably large (say, L = poly(n) or L = n logn or so). Suppose 
that we allow clause space more than the minimum n + 0(1), but less than the trivial upper bound 
Lj log L. Can we then find a resolution refutation using at most that much space and achieving at 
most a polynomial increase in length compared to the minimum? 

Open Problem 4 (|12|). Let F be any CNF formula with \F\ = n clauses (or \ Vars(F)\ = n 
variables). Suppose that L(F h 0) = L. Does this imply that there is a resolution refutation 
tt : F\~ in clause space Sp(ir) = O(n) and length L(ir) = poly(L)? 

If so, this could be interpreted as saying that a smart enough clause learning algorithm can 
potentially find any short resolution refutation in reasonable space (and for formulas that cannot be 
refuted in short length we cannot hope to find refutations efficiently anyway). 

We conclude with a couple of comments on clause space versus clause learning. 

Firstly, we note that it is unclear whether one should expect any fast progress on Open Prob- 
lem|4j at least if if our experience from the case where Sp(ir) < \F\ is anything to go by. Proving 
lower bounds on space in this "low-end regime" for formulas easy with respect to length has been 
(and still is) very challenging. However, it certainly cannot be excluded that problems in the range 
Sp(n) > \F\ might be approached with different and more successful techniques. 

Secondly, we would like to raise the question of whether, in spite of what was just said before 
Open Problem |4j lower bounds on clause space can nevertheless give indications as to which for- 
mulas might be hard for clause learning algorithms and why. Suppose that we know for some CNF 
formula F that Sp (F h 0) is large. What this tells us is that any algorithm, even a non-deterministic 
one making optimal choices concerning which clauses to save or throw away at any given point in 
time, will have to keep a fairly large number of "active" clauses in memory in order to carry out the 
refutation. Since this is so, a real-life deterministic proof search algorithm, which has no sure-fire 
way of knowing which clauses are the right ones to concentrate on at any given moment, might 
have to keep working on a lot of extra clauses in order to be sure that the fairly large critical set of 
clauses needed to find a refutation will be among the "active" clauses. 

Intriguingly enough, pebbling contradictions over pyramids might in fact be an example of this. 
We know that these formulas are very easy with respect to length and width, having constant-width 
refutations that are essentially as short as the formulas themselves. But in l52ll . it was shown that 
state-of-the-art clause learning algorithms can have serious problems with even moderately large 
pebbling contradictions!^] Although we are certainly not arguing that this is the whole story — 
it was also shown in ll52ll that the branching order is a critical factor, and that given some extra 
structural information the algorithm can achieve an exponential speed-up — we wonder whether the 
high lower bound on clause space can nevertheless be part of the explanation. It should be pointed 
out that pebbling contradictions are the only formulas we know of that are really easy with respect 
to length and width but hard for clause space. And if there is empirical data showing that for these 
very formulas clause learning algorithms can have great difficulties finding refutations, it might be 
worth investigating whether this is just a coincidence or a sign of some deeper connection. 
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